AUTOMOTIVE IMMERSIVE SOUND

An Overview

Soundscape Design Developement

Pre

Project Strategy Design

  • Definition of Automotive Sound Targets
  • Impulse Response Positions and Conditions

01.

Acoustic Fingerprint of the Vehicle

  • Multichannel Recording Solutions
  • Evaluation of the Statistical Descriptors

02.

Virtual Acoustic Environment

  • Virtual Environment Implementation
  • Synthesis
  • Tuning

03.

In-situ Installation

  • Tests

Physical

  • Aerodynamic noise
  • Tier/road


Mechanical

  • ICE engine system
  • Air-conditioning
  • Doors opening
  • Gadget sounds

Electrical

  • EV engine system
  • Battery cooling


Acoustic Analysis Methodology

Modal

Method applies to simple systems such as string, pipe or walls




Geometric

Propagation of the acoustic field is treated as if it were governed by rays




Diffuse Field

Acoustic field is evaluated through the hypothesis that after some time from the initial conditions, the energy has spread and distributed homogeneously throughout the space

01. Acoustic Fingerprint of the Vehicle

Multichannel Recording Solutions

Direct

Manually arranging the setup based on heuristics. Will not be further detailed because microphone and loudspeaker layouts are interdependent

Parametric

Process of microphone array signals in the time-frequency domain and estimate spatial parameters based on sound field models

Non-parametric

Linear transformation such as ambisonic. Input channels are encoded in the spherical harmonic. Free loudspeaker layouts

02. Virtual Acoustic Environment

Evaluation of the Statistical Descriptors



Statistical Descriptors

Time required for the impulse response to decay by 60 dB.

Reference time of the first reflections.

Delay time of the first reflection, i.e. the time between the direct sound and the first reflection. Assuming of course that there is direct sound. The optimal values ​​for the ITDG vary depending on the space. Typically for speech, an ITDG of less than 50ms is preferred. Magic value for ITDG is around 30 ms.

If the contributions to the direct sound arrive within 35 ms, therefore the ITDG is equal to or less than this value, all contents are integrated as part of the direct sound. These events are incorporated as the same sound event. If the contributions arrive after an interval of about 80 ms, the uniformity of a sound is broken and we hear the echo. This size is very important, because it is one of those parameters called intimacy. Above a certain value we will have the sensation of a larger environment. From a qualitative point of view, it is important that the first reflections come from the side walls to have a greater pleasantness of the space. This is called lateral asymmetry or interaural coherence.

It expresses a relationship between reverberation times taken at specific frequencies. This value gives us indications of the fullness of the sound in the medium-low register. Typically the best rooms are those that have a BR between 1.2 and 1.25. (BR = RT(125Hz) + RT(250Hz) / RT(500Hz) + RT(1kHz))

It has a similar definition to BR but on different frequencies. Brilliance = RT(500Hz) + RT(1kHz) / RT(2kHz) + RT(4kHz) Brilliance and Bass Ratio describe the brightness of the environment on an acoustic level, i.e. how much medium-low and medium-high contribution there is. These two parameters are obviously linked to the materials of the space. Very smooth and rigid materials increase the Brilliance parameters, while wood contributes to BR. Wood has the ability to preserve the medium-low region of the spectrum (depending on the essence and finish of the wood it changes a lot for the resonator qualities). These two parameters are evaluated for acoustic adaptation works.



Statistical Descriptors Related to Energy

Ratio between useful energy and total energy. The useful energy in this case ranges from 0 to 50 ms, while the total energy ranges from 0 to infinity. In the case of speech the D50 must be greater than 0.5. In the first 50ms there must be at least half of the total energy for speech to be intelligible.

They are defined as the ratio of useful energy to harmful energy. Now the measures are logarithmic. They are used for the intelligibility of speech (C50) and music (C80). The C50 needs all articulations to be audible. For music the C80 is fine as the articulations are slower than speech. The level of intelligibility can be even lower than speech. Optimal values ​​for the C50 and C80.

Is one of the most important characteristics evaluated. To have a correct sensation of spatiality, the ITDGs must not exceed 80 ms and must reach the listener from lateral directions, because this increases the perception of asymmetry of space. The intensity relationships between first reflections and direct sound must respect the laws of the precedence effect (Haas effect). From a frequency point of view, the most important frequency bands for the spatial impression are those with the octave band at 125 Hz (warm sound) and the part at 1000 Hz (opening band), which contributes to giving a spatial presence of the sound, higher presence front. While the 125 Hz band helps create an enveloping sound, the band around 1000 Hz is linked to the feeling of openness of the sound. The impression of spatiality is closely linked to interaural coherence. Low interaural coherence from a larger spatial environment. Asymmetric space = low interaural coherence. We like symmetry from a spatial point of view, but from an acoustic point of view we don't like it very much. The more different the signals that reach the two ears, the more we seem to be in an open space. There are several parameters for the coherence of the signal between two ears, including ITACC or inter aural cross correlation. All these parameters are generally recorded with Dummy-Heads.

This definition starts from the assumption that useful energy decays in the first 50/80 ms. But not in all situations this data is the same. Therefore another parameter is defined, namely the Center Time: barycentric time of the response to the impulse, i.e. it is the time that separates 50% of the energy before an instant and after an instant, therefore identifying half of an IR from the energetic point of view, therefore the time spent which, I have been in dress from 50% of the energy. Smaller ts -> more compact IR -> energy on first reflections. More right ts -> longer reverberation, energy spread on the diffuse part (tail). An environment that is too dry is not perceived as comfortable, so it is good that a certain environment has a certain spatiality, although it is good that the contribution is concentrated at the beginning.

Mmeasure of the acoustic amplification introduced through an amplification apparatus compared with the response of the environment to a sound source without amplification acoustics (similar to the concept of transparent amplification, i.e. an attitude that depends on various factors). The acoustic reinforcement index gives indications on an acoustic apparatus that you do not want to interfere too much with the natural source. there is a standard that defines two possible measurement criteria for this acoustic reinforcement index: background noise and the energy that arrives from the amplified source in relation to the energy that arrives without amplification at 10 meters away. G >= -4dB for rooms intended for music; G >= +4dB for rooms intended for speech (intelligibility).

Synthesis of Digital Sounds

03. In-situ Installation





Thank You!



BIBLIOGRAPHY

Acoustic & Audio Engineering
Automotive

Site was started with Mobirise