• Non ci sono risultati.

Measurement and 5.0 rendering of spatial impulse responses of rooms

N/A
N/A
Protected

Academic year: 2021

Condividi "Measurement and 5.0 rendering of spatial impulse responses of rooms"

Copied!
58
0
0

Testo completo

(1)

116

th

AES Convention Berlin, 8-11 May 2004

Angelo Farina

Dipartimento di Ingegneria Industriale, Università di Parma, Via delle Scienze 181/A Parma, 43100 ITALY

HTTP://pcfarina.eng.unipr.it - mail: angelo.farina@unipr.it

Measurement and 5.0 rendering

of spatial impulse responses of rooms

(2)

116th AES Convention Berlin, 8-11 May 2004

Topics

• This presentation is a tribute to M. Gerzon, who had foreseen 3D impulse response measurements and 3D Auralization obtained by convolution.

• Comparison between Auralizations based on calculated and measured IRs

(e.g. Theatre “La Fenice”, Venice)

• The advantages (and disadavantages) of employing measured IRs

• Possible approaches to Auralization over ITU 5.0

“surround” systems

(3)

116th AES Convention Berlin, 8-11 May 2004

Concept (Gerzon, 1975)

Portable PC with 4- channels sound board Original Room

Sound Source

SoundField Microphone

B-format 4-channels signal (WXYZ) Measurement of B-format

Impulse Responses

MLS or sweep excitation signal

Convolution of dry signals with the B-format Impulse

Responses Sound Source

Mono Mic.

B-format Imp. Resp.

of the original room

B-format 4-channels signal (WXYZ) Convolver

Ambisonics decoder

Speaker array in the reproduction room

(4)

116th AES Convention Berlin, 8-11 May 2004

Why do we measure these IRs?

1. In case something happens to the original space (e.g.: La Fenice theater) they contain a detailed “acoustical photography” which is preserved for the posterity

2. They can be used for studio sound

processing, as artificial reverb and surround filters for today’s (5.1) and tomorrow’s

musical productions

3. Auralization in special listening rooms can

be performed for subjective tests

(5)

116th AES Convention Berlin, 8-11 May 2004

Theatre la Fenice, Venice

The first theatre was realised in 1792 by Gian Antonio Selva, after the burning of Teatro San Benedetto

In December 1836 the theatre burned down again and was rebuilt by G. and T.

Meduna the year after

The theatre was closed in 1995 for maintainance; it had to open again in February 1, 1996, but it burned two days before (January 29, 1996)

A few weeks before the fire, Tronchin measured binaural impulse responses

(6)

116th AES Convention Berlin, 8-11 May 2004

Impulse Responses of La Fenice

In 27 positions a series of binaural impulse responses (with gun shots) was recorded

Each recording is consequently a

stereo file at 16 bits, 48 kHz

During

measurements the room was perfectly fitted, whilst the stage was empty (no scenery)

Point n. 12

(7)

116th AES Convention Berlin, 8-11 May 2004

Numerical simulation vs. measurement

Prelude 1° act “La Traviata”

by G.Verdi

Dry music

Convolution with

experimental I.R. (pt.

12)

Convolution with simulated IR

Stop

(8)

116th AES Convention Berlin, 8-11 May 2004

Advanced IR capture and rendering ( project)

• Description of the measurement technique

• Analysis of some acoustical parameters of some theaters measured

• Description of the processing methods to be employed for transforming the measured data in audible reconstructions of the original spaces

• Description of the usage of the measured data

for studio processing, musical production and

for scientific Auralization tests

(9)

116th AES Convention Berlin, 8-11 May 2004

Sound propagation in rooms

Direct Sound

Reflected Sound

Receiver Direct Sound

Reflected Sound

Point Source

(10)

116th AES Convention Berlin, 8-11 May 2004

Measurement process

The desidered result is the linear impulse response of the acoustic propagation h(t). It can be recovered by knowing the test signal x(t) and the measured system output y(t). It is necessary to exclude the effect of the not-linear part K and of the background noise n(t).

Not-linear, time variant

system K[x(t)]

Noise n(t)

input x(t)

+ output y(t) linear system

w(t)h(t) distorted signal

w(t)

(11)

116th AES Convention Berlin, 8-11 May 2004

Test signal: Log Sine Sweep

x(t) is a sine signal, which frequency is varied

exponentially with time, starting at f

1

and ending at f

2

.

 

 

 

 



 



 

 

 

  

 

 

1 e

f ln f

T f

sin 2 )

t (

x

1

2

f ln f T

t

1

2

1

(12)

116th AES Convention Berlin, 8-11 May 2004

Test Signal – x(t)

(13)

116th AES Convention Berlin, 8-11 May 2004

Measured signal - y(t)

The not-linear behaviour of the loudspeaker causes many harmonics to appear

(14)

116th AES Convention Berlin, 8-11 May 2004

Deconvolution of Log Sine Sweep

The “time reversal mirror” technique is emplyed: the

system’s impulse response is obtained by convolving the measured signal y(t) with the time-reversal of the test signal x(-t). As the log sine sweep does not have a

“white” spectrum, proper equalization is required

Test Signal x(t) Inverse Filter z(t)

(15)

116th AES Convention Berlin, 8-11 May 2004

Inverse Filter – z(t)

The deconvolution of the IR is obtained convolving the measured signal y(t) with the inverse filter z(t)

(16)

116th AES Convention Berlin, 8-11 May 2004

Result of the deconvolution

The last impulse response is the linear one, the preceding are the harmonics distortion products of various orders

2° 1°

5° 3°

(17)

116th AES Convention Berlin, 8-11 May 2004

Maximum Lenght Sequence vs. Sweep

(18)

116th AES Convention Berlin, 8-11 May 2004

Measurement Setup

 The measurement method incorporates all the known techniques:

Binaural

B-format (1

st

order Ambisonics)

WFS (Wave Field Synthesis, circular array)

ITU 5.1 surround (Williams MMA, OCT, INA, etc.) Binaural Room Scanning

M. Poletti high-order virtual microphones

 Any multichannel auralization systems

nowadays available is supported

(19)

116th AES Convention Berlin, 8-11 May 2004

Transducers (sound source #1)

• Equalized, omnidirectional sound source:

Dodechaedron for mid-high frequencies One-way Subwoofer (<120 Hz)

Dodech. LookLine D200

60.0 70.0 80.0 90.0 100.0 110.0 120.0

25 31.5 40 50 63 80 100 125 160 200 250 315 400 500 630 800 1000 1250 1600 2000 2500 3150 4000 5000 6300 8000 10000

Frequency (Hz)

Sound Power Level (dB)

Unequalized Equalized

Lw,tot = 94.8 dB Lw,tot = 106.9 dB

(20)

116th AES Convention Berlin, 8-11 May 2004

Transducers (sound source #2)

• Genelec S30D reference studio monitor:

Three-ways, active multi-amped, AES/EBU Frequency range 37 Hz – 44 kHz (+/- 3 dB)

Genelec S30D

60.0 70.0 80.0 90.0 100.0 110.0 120.0

25 31.5 40 50 63 80 100 125 160 200 250 315 400 500 630 800 1000 1250 1600 2000 2500 3150 4000 5000 6300 8000 10000

Frequency (Hz)

Sound Power Level (dB)

Power Pressure Lw,tot = 96.8 dB

(21)

116th AES Convention Berlin, 8-11 May 2004

Directivity of transducers

- 40 - 35 - 30 - 25 - 20 - 15 - 10 - 5 0 0

30

60

90

120

150 180

210 240 270

300 330

1000 Hz

-40 -35 -30 -25 -20 -15 -10 -5 0 0

30

60

90

120

150 180

210 240 270

300 330

2000 Hz

LookLine D200 dodechaedron

- 40 - 35 - 30 - 25 - 20 - 15 - 10 - 5 0 0

30

60

90

120

150 180

210 240 270

300 330

250 Hz

- 40 - 35 - 30 - 25 - 20 - 15 - 10 - 5 0 0

30

60

90

120

150 180

210 240 270

300 330

4000 Hz

- 40 - 35 - 30 - 25 - 20 - 15 - 10 - 5 0 0

30

60

90

120

150 180

210 240 270

300 330

8000 Hz

- 40 - 35 - 30 - 25 - 20 - 15 - 10 - 5 0 0

30

60

90

120

150 180

210 240 270

300 330

16000 Hz

(22)

116th AES Convention Berlin, 8-11 May 2004

Directivity of transducers

1000 Hz

-40 -35 -30 -25 -20 -15 -10 -5 0

0

30

60

90

120

150

180 210

240 270

300 330

250 Hz

-40 -35 -30 -25 -20 -15 -10 -5 0

0

30

60

90

120

150

180 210

240 270

300 330

2000 Hz

-40 -35 -30 -25 -20 -15 -10 -5 0

0

30

60

90

120

150

180 210

240 270

300 330

4000 Hz

-40 -35 -30 -25 -20 -15 -10 -5 0

0

30

60

90

120

150

180 210

240 270

300 330

8000 Hz

-40 -35 -30 -25 -20 -15 -10 -5 0

0

30

60

90

120

150

180 210

240 270

300 330

16000 Hz

-40 -35 -30 -25 -20 -15 -10 -5 0

0

30

60

90

120

150

180 210

240 270

300 330

Genelec S30D reference studio monitor

(23)

116th AES Convention Berlin, 8-11 May 2004

Transducers (microphones)

• 3 types of microphones:

Binaural dummy head (Neumann KU-100) 2 Cardioids in ORTF placement (Neumann K- 140)

B-Format 4 channels (Soundfield ST-250)

Turntable

Binaural dummy head Cardioids (ORTF) Soundfield Microphone

(24)

116th AES Convention Berlin, 8-11 May 2004

Directivity of transducers

Soundfield ST-250 microphone

0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6

0

30

60

90

120

150

180 210

240 270

300 330

125 Hz

0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6

0

30

60

90

120

150

180 210

240 270

300 330

250 Hz

0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6

0

30

60

90

120

150

180 210

240 270

300 330

500 Hz

0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6

0

30

60

90

120

150

180 210

240 270

300 330

1000 Hz

0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6

0

30

60

90

120

150

180 210

240 270

300 330

2000 Hz

0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6

0

30

60

90

120

150

180 210

240 270

300 330

4000 Hz

0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6

0

30

60

90

120

150

180 210

240 270

300 330

8000 Hz

0 0.2 0.4 0.6 0.8 1 1.2 1.4 1.6

0

30

60

90

120

150 180

210 240 270

300 330

16000 Hz

(25)

116th AES Convention Berlin, 8-11 May 2004

Directivity of transducers

Neumann K-140 (ORTF Cardioids)

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

0 10 20

30 40

50 60

70 80

90 100 110 120 130 140 160150 180 170 200190 210 220 230 240 250 260 270 280 290

300 310

320330340350

125 Hz

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

0 10 20

30 40

50 60

70 80 90 100 110 120 130 140 160150 180 170 200190 210 220 230 240 250 260 270 280 290

300 310

320330340350

250 Hz

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

0 10 20

30 40

50 60

70 80

90 100 110 120 130 140 160150 180 170 200190 210 220 230 240 250 260 270 280 290

300 310

320330340350

500 Hz

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

0 10 20

30 40

50 60

70 80 90 100 110 120 130 140 160150 180 170 200190 210 220 230 240 250 260 270 280 290

300 310

320330340350

1000 Hz

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

0 10 20

30 40

50 60

70 80 90 100 110 120 130 140 160150 180 170 200190 210 220 230 240 250 260 270 280 290

300 310

320330340350

2000 Hz

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

0 10 20

30 40

50 60

70 80 90 100 110 120 130 140 160150 180 170 200190 210 220 230 240 250 260 270 280 290

300 310

320330340350

4000 Hz

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

0 10 20 30

40 50

60 70

80 90 100 110 120 130 140 160150 180 170 200190 210 220 230 240 250 260 270 280 290

300 310

320

330340350

8000 Hz

0 0.1 0.2 0.3 0.4 0.5 0.6 0.7 0.8 0.9 1

0 10 20 30

40 50

60 70

80 90 100 110 120 130 140 160150 180 170 200190 210 220 230 240 250 260 270 280 290

300 310

320

330340350

16000 Hz

(26)

116th AES Convention Berlin, 8-11 May 2004

Other hardware equipment

• Rotating Table:

- Outline ET-1

Computer and sound card

:

Signum Data Futureclient P-IV 1.8 GHz

Aardvark Pro Q-10 (8 ch., 96 kHz, 24 bits)

(27)

116th AES Convention Berlin, 8-11 May 2004

Measurement procedure

A single measurement session play backs 36 times the test signal, and simultaneusly record the 8 microphonic channels

(28)

116th AES Convention Berlin, 8-11 May 2004

Theatres measured

Reverberation Time T20

0 0.5 1 1.5 2 2.5 3 3.5 4 4.5 5

31.5 63 125 250 500 1000 2000 4000 8000 16000

Frequency (Hz)

T20 (s)

Uhara

Noh

Kirishima

Siracusa

Taormina

Audit. Parma

Roma-700

Roma-1200

Roma-2700

Bergamo Cathedral Valli-RE

SOH Concert Hall

SOH-Opera Theatre SOH-The Studio Regio Parma

(29)

116th AES Convention Berlin, 8-11 May 2004

Uhara Hall, Kobe, Japan

T 20 = 1 .4 4 s

(30)

116th AES Convention Berlin, 8-11 May 2004

Noh theater, Kobe, Japan

T 20 = 1 .1 4 s

(31)

116th AES Convention Berlin, 8-11 May 2004

Kirishima Concert Hall, Japan

(32)

116th AES Convention Berlin, 8-11 May 2004

Kirishima Concert Hall, Japan

T 20 = 1 .9 3 s

(33)

116th AES Convention Berlin, 8-11 May 2004

Greek Theater in Siracusa

T 20 = 0 .6 5 s

(34)

116th AES Convention Berlin, 8-11 May 2004

Roman Theater in Taormina

T 20 = 1 .1 5 s

(35)

116th AES Convention Berlin, 8-11 May 2004

Parma Auditorium, Italy

T 20 = 2 .0 8 s

(36)

116th AES Convention Berlin, 8-11 May 2004

Rome Auditorium, 700 seats

T 20 = 2 .0 4 s

(37)

116th AES Convention Berlin, 8-11 May 2004

Rome Auditorium, 1200 seats

T 20 = 2 .1 0 s

(38)

116th AES Convention Berlin, 8-11 May 2004

Rome Auditorium, 2700 seats

T 20 = 2 .5 6 s

(39)

116th AES Convention Berlin, 8-11 May 2004

Bergamo’s Cathedral, Italy

T 20 = 2 .9 5 s

(40)

116th AES Convention Berlin, 8-11 May 2004

Teatro Valli, Reggio Emilia, Italy

T 20 = 1 .5 5 s

(41)

116th AES Convention Berlin, 8-11 May 2004

Sydney Opera House

(42)

116th AES Convention Berlin, 8-11 May 2004

Sydney Opera House – opera theatre

T 20 = 1 .1 6 s

(43)

116th AES Convention Berlin, 8-11 May 2004

Sydney Opera House – concert hall

T 20 = 2 .0 4 s

(44)

116th AES Convention Berlin, 8-11 May 2004

Sydney Opera House – the studio

T 20 = 0 .7 6 s

(45)

116th AES Convention Berlin, 8-11 May 2004

Teatro Regio in Parma (Italy)

T 20 = 1 .1 1 s

(46)

116th AES Convention Berlin, 8-11 May 2004

Auralization by convolution

• The basic method consists in convolution of a dry signal with a set of impulse responses corresponding to the required output format for surround (2 to 24 channels).

• The convolution operation can

nowadays be implemented very

efficiently on a modern PC through an

ancient algorithm (equally-partitioned

FFT processing, Stockam 1966).

(47)

116th AES Convention Berlin, 8-11 May 2004

Auralization types

• Stereo (ORTF on 2 standard loudspeakers at +/- 30°)

• Rotation-tracking reproduction on headphones (Binaural Room Scanning)

• Stereo Dipole (cross-talk cancellation)

• Full 3D Ambisonics 1

st

order (decoding the B-format signal)

• ITU 5.1 “surround sound” systems

• 2D Ambisonics 3

rd

order (from Mark Poletti’s circular array microphone)

• Wave Field Synthesis (from the circular array of Soundfield microphones)

• Hybrid methods (Ambiophonics)

(48)

116th AES Convention Berlin, 8-11 May 2004

Approach # 1 – separate rendering for each recorded track (source)

Each of the dry recordings represents a source in a different position, so it must be separately convolved with its own set of impulse responses

M I X

5 ch.

1x5 1x5

1 ch.

1 ch.

5 ch.

5 ch.

(49)

116th AES Convention Berlin, 8-11 May 2004

Approach # 2 – the recorded tracks are first panned to 5.0, then the “room” is added

The “room effect” is a global filtering

applied to a 5.0 “dry mix” of several tracks

M I X

5 ch.

5 ch.

5 ch.

5 (1x1)

(50)

116th AES Convention Berlin, 8-11 May 2004

Full Auralization vs. Reverb

• In “full auralization” also the direct sound comes from the measured IRs

• In “reverb” the dry signals are employed in place of

the direct sound, and the measured IRs are employed

only for adding subsequent reverberant tail

(51)

116th AES Convention Berlin, 8-11 May 2004

Software tools

• Gerzonic’s Ambisonics decoders (Emigrator, DecoPro) Waves IR1 convolution reverb David McGriffy’s Visual Virtual Mike (VST & standalone)

(52)

116th AES Convention Berlin, 8-11 May 2004

Approach # 1 – single source rendering – choice of 1x5 filter set

1.

Ambisonics (1° order) from a single B-format impulse response

2.

SIRR according to Ville Pulkki (sound

intensity analysis of a single B-format IR)

3.

5 “virtual mikes” from 5 different B-format impulse responses

4.

5 selected Neumann cardioids

5.

(future) – 5°-order Ambisonics from the whole set of cardioid impulse responses

Tr. 01 Tr. 02

Tr. 03

Tr. 04

Turntable Soundfield Microphone

Turntable Soundfield Microphone

Turntable Soundfield Microphones

Turntable Cardioids (ORTF)

Dry speech Dry music

(53)

116th AES Convention Berlin, 8-11 May 2004

Approach # 2 – the recorded tracks are first panned to 5.0, then the “room” is added

5 ch.

Play track 5

to convolver...

(54)

116th AES Convention Berlin, 8-11 May 2004

Approach # 2 – the recorded tracks are first panned to 5.0, then the “room” is added

Ambisonics IR 5 “virtual mikes”

Dry

Track 6

5 Neumann

cardioids

(55)

116th AES Convention Berlin, 8-11 May 2004

Conclusions

• It is now possible to sample accurately the spatial room impulse response, making it possible to store, analyze and preserve a “3D acoustical photography”

• We are still learning what is the best way to render these sets of impulse responses over a standard 5.0 (or 5.1) setup

• The only point which requires substantial

enhancement: sound sources (loudspeakers)

used for IR measurements

(56)

116th AES Convention Berlin, 8-11 May 2004

Future enhancements

Sound source for realistic

emulation of an human singer

(57)

116th AES Convention Berlin, 8-11 May 2004

Future enhancements

Omnidirectional sound source with

enhanced power & frequency response

Dodech. LookLine D300 + Sub Audio Pro B2.27

60 70 80 90 100 110 120

25 31.5 40 50 63 80 100 125 160 200 250 315 400 500 630 800 1000 1250 1600 2000 2500 3150 4000 5000 6300 8000 10000

Frequency (Hz)

Sound Power Level (dB)

Unequalized Equalized Lw,tot = 100.5 dB

Lw,tot = 108.4 dB

(58)

116th AES Convention Berlin, 8-11 May 2004

Acknowledgements

This research was started thanks to the support of Waves, Tel Aviv, Israel

(www.waves.com)

For years 2004 and 2005 the research is also supported by the Italian Ministry for the University and Research (MIUR)

The following software tools were

provided free: Adobe Audition, Gerzonic

Decopro

Riferimenti

Documenti correlati

So a new numerical technique has been developed, making use of a single omnidirectional microphone: 7 impulse response measurements are made, moving the microphone from the

When spatial information is neglected (i.e., both source and receivers are point and omnidirectional), the whole information about the room’s transfer function is contained in its

It is therefore possible to derive subsequently, by post-processing the measured set of impulse responses, the virtual impulse response produced by a source having arbitrary

In fact, a modern technique was recently developed by Ralph Glasgal [1], called Ambiophonics, which aims to the reproduction of a realistic sound field by simultaneous usage of

Gerzon, who had foreseen 3D impulse response measurements and 3D Auralization obtained by convolution.. • Comparison between Auralizations based on calculated and

ƒ ƒ The The “ “ harmonic harmonic orders orders ” ” impulse impulse responses responses obtained obtained by by the the exponential exponential sine sine sweep sweep

 Dave Malham suggested to represent also the source directivity with a set Dave Malham suggested to represent also the source directivity with a set of spherical harmonics,

Gerzon, who had foreseen 3D impulse response measurements and 3D Auralization obtained by convolution.. • The advantages (and disadavantages) of employing