University of Parma

(1)

Angelo Farina Angelo Farina

Industrial Engineering Dept.

University of Parma

University of Parma - - ITALY ITALY HTTP://

HTTP:// www.angelofarina.it www.angelofarina.it

University of Parma

ROOM IMPULSE ROOM IMPULSE

RESPONSES AS TEMPORAL RESPONSES AS TEMPORAL

AND SPATIAL FILTERS

(2)

Traditional time-domain measurements with omnidirectional transducers

Advanced impulse response measurement methods

Directional transducers, the first attempts of spatial analysis

Orthonormal decomposition of the spatial properties in spherical harmonics: the Ambisonics method

The reciprocity principle: directive microphones and directive sources

Generalization of higher-order spherical harmonics representation of both source and receiver directivity

Joining time and space: from Einstein’s view to a

comprehensive data structure representing the acoustical transfer function of a room

Practical usages of measured (or numerically simulated) temporal-spatial impulse response

Topics

(3)

Basic

Basic sound sound propagation propagation scheme scheme

Omnidirectional receiver Direct Sound

Reflected Sound

Point Source

Direct Sound

Reverberant tail

(4)

Traditional

Traditional measurement measurement methods methods

PulsivePulsive sources: sources: ballonsballons, , blankblank pistolpistol

(5)

Modern

Modern electroacoustical electroacoustical methods methods

The sound is The sound is generated generated by by means means of of an an omnidirectional omnidirectional loudspeaker

loudspeaker

The The signal signal is is computer- computer -generated generated

The The same same computer is computer is also also employed employed for for recording recording the the room room ’response ’ response by by means means of of one one or more omnidirectional or more omnidirectional microphones

microphones

Also Also directive directive microphones microphones can be can be used: used : binaural binaural, , figure figure- -of of -eight - eight

Different Different types types of of test signals test signals have have been been developed developed , providing , providing good good immunity immunity to to background noise background noise and easy deconvolution and easy deconvolution of of the impulse the impulse response: response :

► MLS (Maximum Lenght Sequence, pseudo-random white noise)

► TDS (Time Delay Spectrometry, which basically is simply a linear sine sweep, also known in Japan as “stretched pulse”)

► ESS (Exponential Sine Sweep)

Each Each of of these these test signals test signals can be can be employed employed with with different different deconvolution

deconvolution techniques techniques , resulting , resulting in a number in a number of of “different “ different” ” measurement

measurement methods methods

Due to Due to theoretical theoretical and practical and practical considerations, the considerations , the preference preference is is nowadays

nowadays generally generally oriented oriented for for the usage the usage of of ESS with ESS with not- not -circular circular deconvolution

deconvolution

(6)

Measurement process

The desidered result is the linear impulse response of the acoustic propagation h(t). It can be recovered by knowing the test signal x(t) and the measured system output y(t).

It is necessary to exclude the effect of the not-linear part K and of the background noise n(t).

Not-linear, time variant

system K[x(t)]

Noise n(t)

input x(t)

+

output y(t) linear system

w(t)⊗h(t) distorted signal

w(t)

(7)

Edirol FA-101 Firewire sound

card:

10 in / 10 out 24 bit, 192 kHz ASIO and WDM

Hardware: PC and audio interface

(8)

Hardware:

Hardware: loudspeaker loudspeaker & & microphone microphone

Dodechaedron loudspeaker Omnidirectional

microphone

(9)

Aurora Plugins

Generate MLS Generate MLS

Deconvolve DeconvolveMLSMLS

Generate Sweep Generate Sweep

Deconvolve

DeconvolveSweepSweep

Convolution Convolution

Kirkeby

KirkebyInverse FilterInverse Filter

Speech

Speech TransmTransm. Index. Index

Software

(10)

MLS MLS method method

X(t) is X(t) is a periodic a periodic binary binary signal

signal obtained obtained with with a a suitable

suitable shift- shift -register register, , configured

configured for for maximum

maximum lenght lenght of of the the period

period. .

N stages

XOR k stages

x’(n)

1 2

L = ^N −

(11)

MLS MLS deconvolution deconvolution

The The re re - - recorded recorded signal signal y(i) y(i) is is cross cross - - correlated correlated with with the the excitation

excitation signal signal thanks thanks to to a fast Hadamard a fast Hadamard transform. The transform . The result

result is is the the required required impulse impulse response response h(i), if h(i), if the system the system was was linear linear and time and time- -invariant invariant

y 1 M

L h 1

~ ⋅ + ⋅

=

z

Where M is the Hadamard matrix, obtained by permutation of the original MLS sequence m(i)

( )

[ ⁱ ^j ² ^mod ^L ] ¹

m )

j , i (

M ~ = + − −

(12)

MLS MLS example example

Portable PC with 4- channels sound board Original Room

SoundField Microphone

B- format 4- channels signal

(WXYZ)

Measurement of B- format Impulse Responses

MLS excitation signal

Portable PC with additional sound card Room

Microphone

Output signal y Measurement of room

Impulse Response

MLS test signal x Loudspeaker

(13)

MLS MLS example example

(14)

Exponential Sine Sweep method

x(t) is a sine signal, which frequency is varied

exponentially with time, starting at f

₁

and ending at f

₂

.

⎥ ⎥

⎦

⎤

⎢ ⎢

⎣

⎡

⎟⎟

⎟ ⎟

⎠

⎞

⎜⎜

⎜ ⎜

⎝

⎛

−

⋅

⎟⎟ ⎠

⎜⎜ ⎞

⎝

⎛

⋅

⋅ π

= ⋅ ^⎟⎟ ^⎠

⎜⎜ ⎞

⎝

⋅ ⎛

1 e

f ln f

T f

sin 2 )

t (

x ¹

2 f ln f T

t

1

2

1

(15)

Test Signal – x(t)

(16)

Measured signal - y(t)

The notThe not-linear-linear behaviourbehaviourofofthe loudspeakerthe loudspeaker causescauses manymanyharmonicsharmonics toto appear

appear

(17)

Inverse Filter – z(t)

The The deconvolutiondeconvolution ofof the IR isthe IR is obtainedobtained convolvingconvolving the the measuredmeasured signalsignal y(t)

y(t) withwith the inverse filterthe inverse filter z(t) [equalizedz(t) [equalized, , time-time-reversedreversed x(t)]x(t)]

(18)

Deconvolution of Log Sine Sweep

The “time reversal mirror” technique is employed: the system’s impulse response is obtained by convolving the measured signal y(t) with the time-reversal of the test signal x(-t). As the log sine sweep does not have a “white” spectrum, proper equalization is required

Test Signal x(t) Inverse Filter z(t)

(19)

Result of the deconvolution

The last impulse response is the linear one, the preceding are the harmonics distortion products of various orders

2° 1°

5° 3°

(20)

IR IR Selection Selection

AfterAfter the sequencethe sequence ofofimpulseimpulse responsesresponseshashas beenbeen obtained, obtained, itit isis possiblepossible toto selectselectand and insulateinsulatejust just oneoneofof them:them:

(21)

ESS ESS example example

B- format 4- channels signal

(WXYZ)

Measurement of B- format Impulse Responses

MLS excitation signal

Portable PC with additional sound card Room

Microphone

Output signal y Measurement of room

Impulse Response

Sweep test signal x Loudspeaker

(22)

ESS example

(23)

Maximum Length Sequence vs. Sweep

(24)

Post processing of impulse responses

A special plugin has been developed for the computation of STI according A special plugin has been developed for the computation of STI according to IEC

to IEC--EN 60268EN 60268--16:200316:2003

(25)

Post processing of impulse responses

A special plugin has been developed for performing analysis of acoustical A special plugin has been developed for performing analysis of acoustical parameters according to ISO

parameters according to ISO--33823382

(26)

The new AQT plugin for Audition

The new module is still under development and will allow for very fast The new module is still under development and will allow for very fast computation of the AQT (Dynamic Frequency Response) curve from w

computation of the AQT (Dynamic Frequency Response) curve from within ithin Adobe Audition

Adobe Audition

(27)

Spatial analysis by directive microphones Spatial analysis by directive microphones

The initial approach was to use directive microphones for gathering some The initial approach was to use directive microphones for gathering some information about the spatial properties of the sound field

information about the spatial properties of the sound field ““as perceived by as perceived by the listener

the listener””

Two apparently different approaches emerged: binaural dummy heads and Two apparently different approaches emerged: binaural dummy heads and pressure

pressure--velocity microphones:velocity microphones:

Binaural Binaural microphone (left) microphone (left)

and and

variable

variable--directivity directivity microphone (right) microphone (right)

(28)

“ “ objective objective ” ” spatial parameters spatial parameters

It was attempted to “It was attempted to “quantifyquantify”” the “the “spatialityspatiality”” of a room by means of of a room by means of

“objective“objective”” parameters, based on 2-parameters, based on 2-channels impulse responses measured channels impulse responses measured with directive microphones

with directive microphones

The most famous The most famous “spatial“spatial”” parameter is IACC (Inter Aural Cross parameter is IACC (Inter Aural Cross Correlation), based on binaural IR measurements

Correlation), based on binaural IR measurements

LeftLeft

Right Right

80 ms 80 ms

( )

( ) ( )

pp_L_L(τ(τ))

pp_R_R((ττ))

[ ] ( )

^t ^t

[

¹^ms^... ¹^ms

]

Max IACC

d t p

d p

d t p

p

t _E

ms 80

0 2R ms

80

0 2L ms 80

0

R L

+

−

∈ ρ

= τ

⋅ + τ

⋅ τ

τ

⋅ + τ

⋅ τ

= ρ

∫

(29)

“ “ objective objective ” ” spatial parameters spatial parameters

Other “Other “spatialspatial”” parameters are the Lateral Energy ratios: LE, LF, LFCparameters are the Lateral Energy ratios: LE, LF, LFC

These are defined from a 2-These are defined from a 2-channels impulse response, the first channel is a channels impulse response, the first channel is a standard omni microphone, the second channel is a

standard omni microphone, the second channel is a ““figurefigure--ofof--eighteight”” microphone:

microphone:

( ) ( )

∫ ( )

∫

τ

⋅ τ

τ

⋅ τ

=

80ms

ms 0

o2 ms 80

ms 5

o 8

d h

h LFC

Figure Figure

of 8 of 8 OmniOmni

( )

∫ ( )

∫

τ

⋅ τ

τ

⋅ τ

=

80ms

ms 0

o2 ms 80

ms 5

82

d h

LF

hh_o_o(τ(τ))

hh₈₈((ττ))

( )

∫ ( )

∫

τ

⋅ τ

τ

⋅ τ

=

80ms

ms 0

o2 ms 80

ms 25

82

d h

LE

(30)

Robustness of spatial parameters Robustness of spatial parameters

Both IACC and LF depend strongly on the orientation of the microphonesBoth IACC and LF depend strongly on the orientation of the microphones

Binaural and pressure-Binaural and pressure-velocity measurements were performed in 2 velocity measurements were performed in 2 theatres employing a rotating table for turning the microphones theatres employing a rotating table for turning the microphones

(1-LF) Auditorium Parma – Sorgente a sx

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

0 10

20 30

40 50

60 70

80 90 100 110 120 130 140 150 170 160 190 180

200 210 220 230 240 250 260 270 280 290

300 310

320 330

340 350 Sorgente

(1-LF) Auditorium Roma (Sala 1200) – Sorgente a sx

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7

0 10

20 30

40 50

60 70

80 90 100 110 120 130 140 150 170 160 200 190

210 220 230 240 250 260 270 280 290

300 310

320 330

340 350 Sorgente

Theatre 1-LF IACC Parma 0.725 0.266 Roma 0.676 0.344

IACC Auditorium Parma - Sorgente a sx

0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0

10 20 30

40 50

60 70

80 90 100 110 120 130 140 150 170 160 190 180

200 210 220 230 240 250 260 270 280 290

300 310

320

330 340 350 Sorgente

IACC Auditorium Roma (Sala 1200) - Sorgente a sx

0 0.05 0.1 0.15 0.2 0.25 0.3 0.35 0

10 20 30

40 50

60 70

80 90 100 110 120 130 140 150 170 160 200 190

210 220 230 240 250 260 270 280 290

300 310

320

330 340 350 Sorgente

(31)

Are binaural measurents reproducible?

Experiment performed in anechoic room Experiment performed in anechoic room --same loudspeaker, same source same loudspeaker, same source and receiver positions, 5 binaural dummy heads

and receiver positions, 5 binaural dummy heads

(32)

Are binaural measurents reproducible?

90°90° incidence incidence --at low frequency IACC is almost 1, at high frequency the at low frequency IACC is almost 1, at high frequency the difference between the heads becomes evident

difference between the heads becomes evident

IACCe - 90° incidence

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

31.5 63 125 250 500 1000 2000 4000 8000 16000

Frequency (Hz)

IACCe

B&K4100 Cortex Head Neumann

(33)

Are binaural measurents reproducible?

Diffuse field Diffuse field --the difference between the heads is now dramaticthe difference between the heads is now dramatic

IACCe - random incidence

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

31.5 63 125 250 500 1000 2000 4000 8000 16000

Frequency (Hz)

IACCe

B&K4100 Cortex Head Neumann

(34)

Are LF measurents reproducible?

Experiment performed in the Auditorium of Parma -Experiment performed in the Auditorium of Parma - same loudspeaker, same loudspeaker, same source and receiver positions, 5 pressure

same source and receiver positions, 5 pressure--velocity microphonesvelocity microphones

(35)

Are LF measurents reproducible?

At 7.5 m distance, the results already exhibit significant scatterAt 7.5 m distance, the results already exhibit significant scatter

Comparison LF - measure 1 - 7.5m distance

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

31.5 63 125 250 500 1000 2000 4000 8000 16000

Frequency (Hz)

LF

Schoeps Neumann Soundfield B&K

(36)

Are LF measurents reproducible?

At 25 m distance, the scatter is even larger....At 25 m distance, the scatter is even larger....

Comparison LF - measure 2 - 25m distance

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

31.5 63 125 250 500 1000 2000 4000 8000 16000

Frequency (Hz)

LF

Schoeps Neumann Soundfield B&K

(37)

3D extension of the pressure

3D extension of the pressure- -velocity measurements velocity measurements

The Soundfield microphone allows for simultaneous measurements of the The Soundfield microphone allows for simultaneous measurements of the omnidirectional pressure and of the three cartesian components o

omnidirectional pressure and of the three cartesian components of particle f particle velocity (figure

velocity (figure--ofof--8 patterns)8 patterns)

(38)

3D Impulse Response (Gerzon, 1975) 3D Impulse Response (Gerzon, 1975)

Sound Source

B-format 4-channels signal (WXYZ) Measurement of B-format

Impulse Responses

MLS or sweep excitation signal

Convolution of dry signals with the B-format Impulse

Responses Sound Source

Mono Mic.

B-format Imp. Resp.

of the original room

B-format 4-channels signal (WXYZ) Convolver

Ambisonics decoder

Speaker array in the reproduction room

(39)

The Waves project (2003) The Waves project (2003)

The original idea of Michael Gerzon was finally put in practice in 2003, The original idea of Michael Gerzon was finally put in practice in 2003, thanks to the Israeli

thanks to the Israeli--based company WAVESbased company WAVES

More than 50 theatres all around the world were measured, capturing 3D More than 50 theatres all around the world were measured, capturing 3D IRs (4

IRs (4-channels B-channels B--format with a Soundfield microphone)format with a Soundfield microphone)

The measurments did also include binaural impulse responses, andThe measurments did also include binaural impulse responses, and a a circular

circular--array of microphone positionsarray of microphone positions

More details on WWW.ACOUSTICS.NETMore details on WWW.ACOUSTICS.NET

(40)

Directivity of transducers Directivity of transducers

- 40 - 35 - 30 - 25 - 20 - 15 - 10 - 5 0

0

30

60

90

120

150 180

210 240 270

300 330

1000 Hz

-40 -35 -30 -25 -20 -15 -10 -5 0 0

30

60

90

120

150 180

210 240 270

300 330

2000 Hz

LookLine D200 dodechaedron LookLine D200 dodechaedron

- 40 - 35 - 30 - 25 - 20 - 15 - 10 - 5 0

0

30

60

90

120

150 180

210 240 270

300 330

250 Hz

- 40 - 35 - 30 - 25 - 20 - 15 - 10 - 5 0

0

30

60

90

120

150 180

210 240 270

300 330

4000 Hz

- 40 - 35 - 30 - 25 - 20 - 15 - 10 - 5 0

0

30

60

90

120

150 180

210 240 270

300 330

8000 Hz

- 40 - 35 - 30 - 25 - 20 - 15 - 10 - 5 0

0

30

60

90

120

150 180

210 240 270

300 330

16000 Hz

(41)

Directivity of transducers Directivity of transducers

Soundfield ST

Soundfield ST--250 microphone250 microphone

0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6

0

30

60

90

120

150 180

210 240 270

300 330

125 Hz

0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6

0

30

60

90

120

150 180

210 240 270

300 330

250 Hz

0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6

0

30

60

90

120

150 180

210 240 270

300 330

500 Hz

0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6

0

30

60

90

120

150 180

210 240 270

300 330

1000 Hz

0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6

0

30

60

90

120

150 180

210 240 270

300 330

2000 Hz

0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6

0

30

60

90

120

150 180

210 240 270

300 330

4000 Hz

0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6

0

30

60

90

120

150 180

210 240 270

300 330

8000 Hz

0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6

0

30

60

90

120

150 180

210 240 270

300 330

16000 Hz

(42)

What about source directivity ? What about source directivity ?

Current 3D IR sampling is still based on the usage of an “Current 3D IR sampling is still based on the usage of an “omnidirectionalomnidirectional”” source

source

The knowledge of the 3D IR measured in this way provide no information The knowledge of the 3D IR measured in this way provide no information about the soundfield generated inside the room from a directive

about the soundfield generated inside the room from a directive source source (i.e., a musical instrument, a singer, etc.)

(i.e., a musical instrument, a singer, etc.)

Dave Malham suggested to represent also the source directivity with a set Dave Malham suggested to represent also the source directivity with a set of spherical harmonics, called O

of spherical harmonics, called O--format format -- this is perfectly reciprocal to the this is perfectly reciprocal to the representation of the microphone directivity with the B

representation of the microphone directivity with the B--format signals format signals (Soundfield microphone).

(Soundfield microphone).

Consequently, a complete and reciprocal spatial transfer function can be Consequently, a complete and reciprocal spatial transfer function can be defined, employing a 4

defined, employing a 4--channels Ochannels O--format source and a 4format source and a 4--channels Bchannels B-- format receiver:

format receiver:

Portable PC with 4in-4out channels sound board O-format 4-channels source

B-format 4-channels microphone (Soundfield)

(43)

1st order MIMO impulse response 1st order MIMO impulse response

If only spherical harmonics of order 0 and 1 are taken into account, If only spherical harmonics of order 0 and 1 are taken into acco unt, a complete spatial transfer function measurement requires

a complete spatial transfer function measurement requires 16 impulse responses:

16 impulse responses:

{ } ^y [ ] ^h { } ^x

or x

x x x

h h

y y y y

z y x w

zz zy

zx zw

yz yy

yx yw

xz xy

xx xw

wz wy

wx ww

z y x w

⊗

=

⎪ ⎪

⎭

⎪ ⎪

⎬

⎫

⎪ ⎪

⎩

⎪ ⎪

⎨

⎧

⊗

⎥ ⎥

⎥

⎦

⎤

⎢ ⎢

⎢

⎣

⎡

=

⎪ ⎪

⎭

⎪ ⎪

⎬

⎫

⎪ ⎪

⎩

⎪ ⎪

⎨

⎧

Once these 16 IRs have been measured, it is possible to compute Once these 16 IRs have been measured, it is possible to compute the response of the room with a source and a receiver having

the response of the room with a source and a receiver having arbitrary directivity patterns, given by the O

arbitrary directivity patterns, given by the O -format source functions - format source functions {r {r

_w_w

, r , r

_x_x

, r , r

_y_y

, r , r

_z_z

}, and the B- }, and the B -format receiver functions {r format receiver functions {r

_w_w

, r , r

_x_x

, r , r

_y_y

, r , r

_z_z

}: }:

{ } { } { } ^r ^y ^r [ ] ^h { } ^s

t _sr = ⊗ = ⊗ ⊗

In which also each of {s} and {r} are sets of 4 impulse responses, In which also each of {s} and {r} are sets of 4 impulse response s, representing the frequency

representing the frequency- -dependent directivities of the source dependent directivities of the source and of the receiver

and of the receiver

(44)

Limits of the 1

Limits of the 1 ^st ^st - - order method order method

Albeit mathematically elegant and easy to implement with currentlyAlbeit mathematically elegant and easy to implement with currently--existing hardware, existing hardware, the 1

the 1^st^st-order method presented here cannot represent faithfully the comp-order method presented here cannot represent faithfully the complex directivity lex directivity pattern of an human voice or of an human ear:

pattern of an human voice or of an human ear:

-20 -15 -10 -5 0 5

0

15

30 45

60

75

90

105

120

135

150 165

180 195

210 225 240 255 270

285 300

315 330

345

Real

1st order approx.

W X Y

1.421 0.289 0.000

(45)

Limits of the 1

Limits of the 1 ^st ^st - - order method order method

The polar pattern of a binaural dummy head is even more complex,The polar pattern of a binaural dummy head is even more complex,as shown as shown here (1 kHz, right ear):

here (1 kHz, right ear):

-20 -15 -10 -5 0

0 5 10 15 20

2530 3540

45 50

55 60

65 70

75 80

85 90 95 100 105 110 115 120 125 130 135 145140 155150 165160 175170 185 180

195190 205200 215210 220 225 230 235 240 245 250 255 260 265 270 275 280

285 290

295 300

305 310

315320325330335340345350355

Measured 1st order approx.

W X Y

0.395 -0.015 0.191

(46)

How to get better spatial resolution?

The answer is simple: analyze the spatial distribution of both source and receiver by The answer is simple: analyze the spatial distribution of both source and receiver by means of higher

means of higher--order spherical harmonics expansionorder spherical harmonics expansion

Spherical harmonics analysis is the equivalent, in space domain,Spherical harmonics analysis is the equivalent, in space domain,of the Fourier of the Fourier analysis in time domain

analysis in time domain

As a complex time-As a complex time-domain waveform can be though as the sum of a number of domain waveform can be though as the sum of a number of sinusoidal and cosinusoidal functions, so a complex spatial dist

sinusoidal and cosinusoidal functions, so a complex spatial distribution around a ribution around a given notional point can be expressed as the sum of a number of

given notional point can be expressed as the sum of a number of spherical harmonic spherical harmonic functions

functions

-0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6 0.8

0 0.5 1 1.5 2 2.5 3

dc Cos(5) Sin(5) Cos(10) Sin(10)

-0.8 -0.6 -0.4 -0.2 0 0.2 0.4 0.6 0.8 1

0 0.5 1 1.5 2 2.5 3

Sum

-20 0

0 5 10 1520

2530 35

40 45

50 55

60 65

70 75

80 85 90 95 100 105 110 115 120 125 130 135 140 145 155150 165160 200195

210205 215 220 225 230 235 240 245 250 255 260 265 270 275 280 285 290

295 300

305 310

315 320

325330335340345350 355 0

0.35

0 5 10 1520

2530 35

40 45

50 55

60 65

70 75

80 85 90 95 100 105 110 115 120 125 130 135 140 150145 160155 170165 180 175 190185 200195 210205 215 220 225 230 235 240 245 250 255 260 265 270 275 280 285

290 295

300 305

310 315

320325330335340345350 355

(47)

Higher

Higher - - order spherical harmonics expansion order spherical harmonics expansion

(48)

3 3 ° ° - - order microphone (Trinnov order microphone (Trinnov - - France) France)

Arnoud Laborie developed a 24- Arnoud Laborie developed a 24 -capsule compact microphone capsule compact microphone array

array - - by means of advanced digital filtering, spherical ahrmonic by means of advanced digital filtering, spherical ahrmonic signals up to 3

signals up to 3° ° order are obtained (16 channels) order are obtained (16 channels)

(49)

4 4 ° ° - - order microphone (France Telecom) order microphone (France Telecom)

Jerome Daniel and Sebastien Moreau built samples of 32- Jerome Daniel and Sebastien Moreau built samples of 32 -capsules capsules spherical arrays

spherical arrays - - these allow for extractions of microphone signals these allow for extractions of microphone signals up to 4

up to 4 ° ° order (25 channels) order (25 channels)

(50)

Multichannel software for high

Multichannel software for high- - order order

Plogue Bidule can be used as multichannel host software, runningPlogue Bidule can be used as multichannel host software, runninga number of VST a number of VST plugins developed by France Telecom

plugins developed by France Telecom --these include spherical harmonics extraction from these include spherical harmonics extraction from the spherical microphone arrays, rotation and manipulation of th

the spherical microphone arrays, rotation and manipulation of the multichannel Be multichannel B--format format signals, and final rendering either on head

signals, and final rendering either on head--.tracked headphones or on a static array of .tracked headphones or on a static array of loudspeakers (high

loudspeakers (high--order Ambisonics)order Ambisonics)

(51)

Verification of high

Verification of high - - order patterns order patterns

Sebastien Moreau and Olivier Warusfel verified the directivity pSebastien Moreau and Olivier Warusfel verified the directivity patterns of the atterns of the 44°-°-order microphone array in the anechoic room of IRCAM (Paris)order microphone array in the anechoic room of IRCAM (Paris)

5 kHz 5 kHz

10 kHz 10 kHz

(52)

Frequency extension of the patterns

(53)

High High - - order sound source order sound source

University of California Berkeley's Center for New Music and Audio University of California Berkeley's Center for New Music and Audio Technologies (CNMAT) developed a new 120

Technologies (CNMAT) developed a new 120--loudspeakers, digitally loudspeakers, digitally controlled sound source, capable of synthesizing sound emission controlled sound source, capable of synthesizing sound emission according to spherical harmonics patterns up to 5

according to spherical harmonics patterns up to 5°° order.order.

(54)

Technical details of high

Technical details of high - - order source order source

Class-Class-D embedded amplifiersD embedded amplifiers

Embedded ethernet interface Embedded ethernet interface and DSP processing

and DSP processing

Long-Long-excursion special excursion special Meyer Sound drivers Meyer Sound drivers

(55)

Accuracy of spatial synthesis Accuracy of spatial synthesis

The spatial reconstruction error of a 120-The spatial reconstruction error of a 120-loudspeakers array is frequency loudspeakers array is frequency dependant, as shown here:

dependant, as shown here:

The error is acceptably low over an extended frequency range up to 5The error is acceptably low over an extended frequency range up to 5°°--orderorder

(56)

Advanced digital filtering techniques Advanced digital filtering techniques

A set of digital filters can be employed for sinthesizing the A set of digital filters can be employed for sinthesizing the

required spatial pattern (spherical harmonis), either when deali required spatial pattern (spherical harmonis), either when dealing ng with a microphone array or when dealing with a loudspeaker array with a microphone array or when dealing with a loudspeaker array

Whatever theory or method is chosen, we always start with N Whatever theory or method is chosen, we always start with N input signals x

input signals x

_i_i

, and we derive from them M output signals y , and we derive from them M output signals y

_j_j

And, in any case, each of these M outputs can be expressed by: And, in any case, each of these M outputs can be expressed by:

i N

1 i

ij

j h x

y ∑

=

⊗

=

processor N inputs

M outputs

(57)

Example with a microphone array Example with a microphone array

The sound field is sampled in N points by means of a microphone arrayThe sound field is sampled in N points by means of a microphone array

Is the time-domain sampled waveform of a wave with well defined spatial characteristics, for example:

• a spherical wave centered in a precise emission point P

_source

• a plane wave with a certain direction

• a spherical harmonic referred to a receiver point P

_rec

Pre-calculated filters

i N

1 i

ij

j

h x

y ∑

=

⊗

=

x

₁

(t) x

₂

(t) x

₃

(t) x

₄

(t)

y

_j

(t)

(58)

Traditional design of digital filters Traditional design of digital filters

The processing filters h The processing filters h

_ij_ij

are usually computed are usually computed following one of several, complex mathematical following one of several, complex mathematical

theories, based on the solution of the wave theories, based on the solution of the wave

equation (often under certaing simplifications), equation (often under certaing simplifications),

and assuming that the microphones are ideal and and assuming that the microphones are ideal and

identical identical

In some implementations, the signal of each In some implementations, the signal of each

microphone is processed through a digital filter microphone is processed through a digital filter

for compensating its deviation, at the expense of for compensating its deviation, at the expense of

heavier computational load

(59)

Novel approach Novel approach

No theory is assumed: the set of h No theory is assumed: the set of h

_ij_ij

filters are filters are derived directly from a set of impulse response derived directly from a set of impulse response

measurements, designed according to a least measurements, designed according to a least - -

squares principle.

In practice, a matrix of filtering coefficients, is In practice, a matrix of filtering coefficients, is formed, and the matrix has to be numerically formed, and the matrix has to be numerically

inverted (usually employing some regularization inverted (usually employing some regularization

technique).

This way, the outputs of the microphone array are This way, the outputs of the microphone array are maximally close to the ideal responses prescribed maximally close to the ideal responses prescribed

This method also inherently corrects for transducer This method also inherently corrects for transducer deviations and acoustical artifacts (shielding,

deviations and acoustical artifacts (shielding, diffractions, reflections, etc.)

diffractions, reflections, etc.)

(60)

Example: synthesizing 0

Example: synthesizing 0 - - order shape order shape

The microphone array impulse responses c

_k,i

, are measured for a number of P incoming directions.

…we get the filters to be applied to the microphonic signals from processing the matrix of measured impulse responses

⎥ ⎥

⎦

⎤

⎢ ⎢

⎣

⎡

=

N , P N

, k N

, 2 N

, 1

i , P i

, k i

, 2 i

, 1

2 , P 2

, k 2

, 2 2

, 1

1 , P 1

, k 1

, 2 1

, 1

c c

i=1…N mikes C

k=1…P sources

c

_ki

(61)

Example: synthesizing 0

Example: synthesizing 0 - - order shape order shape

δ

=

⊗

δ

=

⊗

δ

=

⊗

δ

=

⊗

∑

=

N 1 i

i , P 0

, i N

1 i

i , k 0

, i N

1 i

i , 2 0

, i N

1 i

i , 1 0

, i

c h

...

c h

...

c h

Lets call v

_k

the right-hand vector of known results (they will be different for higher-order shapes).

Once this matrix of N inverse filters are computed (for example, employing the Nelson/Kirkeby method), the output of the

microphone array, synthesizing the prescribed 0th-order shape, will again be simply:

∑ =

⊗

= ^N

1 i

0 , i i

0 x h

y

We design the filters in such a way that the response of the system is the prescribed theoretical function v

_k

for the k-th source (an unit- amplitude Dirac’s Delta function in the case of the example, as the 0th-order function is omnidirectional).

So we set up a linear equation system of P equations, imposing

that:

(62)

System

System ’ ’ s least s least - - squares inversion squares inversion

For computing the matrix of N filtering coefficients hFor computing the matrix of N filtering coefficients h_i0_i0, a least-, a least-squares squares method is employed.

method is employed.

A “A “total squared errortotal squared error”” εε_tot_tot is defined as:is defined as:

( )

∑ ∑ = = ⎥ ⎥

⎦

⎤

⎢ ⎢

⎣

⎡ ⊗ −

=

ε ^P

1 k

2 k N

1 i

ki 0

i

tot h c v

A set of N linear equations is formed by minimising A set of N linear equations is formed by minimising ε ε

University of Parma

Angelo Farina Angelo Farina

Industrial Engineering Dept.

Industrial Engineering Dept.