This paper introduces a ranking and selection approach to psychoacoustic and psychophysical experimentation, with the aim of identifying top-ranking samples in listening experiments with minimal pairwise comparisons. We draw inspiration from sports tournament designs and propose to adopt modified knockout (KO) tournaments. Two variants of modified KO tournaments are described, which adapt the tree selection sorting algorithm and the replacement selection algorithm known from computer science. To validate the proposed method, a listening experiment is conducted, where binaural renderings of seven chamber music halls are compared regarding loudness and reverberance. The rankings obtained by the modified KO tournament method are compared to those obtained from a traditional round-robin (RR) design, where all possible pairs are compared. Moreover, the paper presents simulations to illustrate the method's robustness when choosing different parameters and assuming different underlying data distributions. The study's findings demonstrate that modified KO tournaments are more efficient than full RR designs in terms of the number of comparisons required for identifying the top ranking samples. Thus, they provide a promising alternative for this task. We offer an open-source implementation so that researchers can easily integrate KO designs into their studies.

1.
Adler
,
I.
,
Cao
,
Y.
,
Karp
,
R.
,
Peköz
,
E. A.
, and
Ross
,
S. M.
(
2017
). “
Random knockout tournaments
,”
Oper. Res.
65
,
1589
1596
.
2.
Amlani
,
A. M.
, and
Schafer
,
E. C.
(
2009
). “
Application of paired-comparison methods to hearing aids
,”
Trends Amplif.
13
,
241
259
.
3.
Appleton
,
D. R.
(
1995
). “
May the best man win?
,”
J. R. Stat. Soc. Ser. D
44
,
529
538
.
4.
Blickle
,
T.
, and
Thiele
,
L.
(
1996
). “
A comparison of selection schemes used in evolutionary algorithms
,”
Evol. Comput.
4
,
361
394
.
5.
Bradley
,
R. A.
, and
Terry
,
M. E.
(
1952
). “
Rank analysis of incomplete block designs: I. The method of paired comparisons
,”
Biometrika
39
,
324
345
.
6.
David
,
H. A.
(
1959
). “
Tournaments and paired comparisons
,”
Biometrika
46
,
139
149
.
7.
David
,
H. A.
(
1963
). “
The method of paired comparisons
,” in
Griffin's Statistical Monographs & Courses
, edited by
M. G.
Kendall
(
Griffin
,
Braintree, MA
).
8.
David
,
H. A.
(
1987
). “
Ranking from unbalanced paired-comparison data
,”
Biometrika
74
,
432
436
.
9.
David
,
H. A.
, and
Andrews
,
D. M.
(
1987
). “
Closed adaptive sequential paired-comparison selection procedures
,”
J. Stat. Comput. Simul.
27
,
127
141
.
10.
Dudewicz
,
E. J.
(
1980
). “
Ranking (ordering) and selection: An overview of how to select the best
,”
Technometrics
22
(
1
),
113
119
.
11.
Ekman
,
M.
,
Olsson
,
A. A.
,
Andersson
,
K.
,
Jonsson
,
A.
,
Stelick
,
A.
, and
Dando
,
R.
(
2020
). “
Applying sorting algorithms to sensory ranking tests—A proof of concept study
,”
Curr. Res. Food Sci.
2
,
41
44
.
12.
Engels
,
B.
(
2015
). “
XNomial: Exact goodness-of-fit test for multinomial data with fixed probabilities, r package version 1.0.4
,” https://CRAN.R-project.org/package=XNomial (Last viewed June 18, 2024).
13.
Ennis
,
D. M.
(
1993
). “
A single multidimensional model for discrimination, identification and preferential choice
,”
Acta Psychol.
84
(
1
),
17
27
.
14.
Ennis
,
D. M.
,
Mullen
,
K.
, and
Firjters
,
J. E. R.
(
1988
). “
Variants of the method of triads: Unidimensional Thurstonian models
,”
Brit. J. Math. Statis.
41
,
25
36
.
15.
FIFA
(
2022
). “
FIFA World Cup 2022 regulations, Fédération Internationale de Football Association
,” https://digitalhub.fifa.com/m/2744a0a5e3ded185/original/FIFA-World-Cup-Qatar-2022-Regulations_EN.pdf (Last viewed May 15, 2024).
16.
Fu
,
M. C.
, and
Henderson
,
S. G.
(
2017
). “
History of seeking better solutions, AKA simulation optimization
,” in
Proceedings of the 2017 Winter Simulation Conference (WSC)
,
December 3–6
,
Las Vegas, NV
, pp.
131
157
.
17.
Gibbons
,
J. D.
,
Olkin
,
I.
, and
Sobel
,
M.
(
1979
). “
An introduction to ranking and selection
,”
Am. Stat.
33
(
4
),
185
195
.
18.
Glenn
,
W. A.
(
1960
). “
A comparison of the effectiveness of tournaments
,”
Biometrika
47
,
253
262
.
19.
Gomes
,
O. C.
,
Lachenmayr
,
W.
,
Thilakan
,
J.
, and
Kob
,
M.
(
2021
). “
Anechoic multi-channel recordings of individual string quartet musicians
,” in
Proceedings of the 2021 Immersive and 3D Audio: From Architecture to Automotive (I3DA)
,
September 8–10
,
Bologna, Italy
, pp.
1
7
.
20.
Hadian
,
A.
, and
Sobel
,
M.
(
1962
). “
Selecting the t-th largest using binary errorless comparisons
,”
Technical Report 121
(
University of Minnesota
,
Minneapolis, MN
).
21.
Handa
,
B. R.
, and
Maitri
,
V.
(
1961
). “
On a knockout selection procedure
,”
Sankhyā: Indian J. Stat., Ser. A
46
,
267
276
.
22.
Hong
,
L. J.
,
Fan
,
W.
, and
Luo
,
J.
(
2021
). “
Review on ranking and selection: A new perspective
,”
Front. Eng. Manag.
8
(
3
),
321
343
.
23.
Hunter
,
J. D.
(
2007
). “
Matplotlib: A 2D graphics environment
,”
Comput. Sci. Eng.
9
(
3
),
90
95
.
24.
Hyvärinen
,
P.
(
2024
). “Knockout tournament for Psychoacoustics,”
GitHub
, https://doi.org/10.5281/zenodo.10572138.
25.
Hyvärinen
,
P.
, and
Meyer-Kahlen
,
N.
(
2023
). “
Repeated knockout tournaments for efficient screening of the top-ranking sample
,” in
Proceedings of the Audio Engineering Society Convention 155
,
October 25–27
,
New York
.
26.
ISO 3382-1:2009
(
2009
). “
Acoustics — Measurement of room acoustic parameters. Part 1: Performance spaces
” (International Organization for Standardization, Geneva, Switzerland), available at https://www.iso.org/standard/40979.html.
27.
ITU
(
2015
). ITU-R BS.1116,
Methods for the Subjective Assessment of Small Impairments in Audio Systems
(
ITU
,
Geneva, Switzerland
).
28.
ITU
(
2023
). ITU-R BS.1285,
Pre-Selection Methods for the Subjective Assessment of Small Impairments in Audio Systems
(
ITU
,
Geneva, Switzerland
).
29.
Iwaya
,
Y.
(
2006
). “
Individualization of head-related transfer functions with tournament-style listening test: Listening with other's ears
,”
Acoust. Sci. Technol.
27
(
6
),
340
343
.
30.
Kislitsyn
,
S. S.
(
1964
). “
О ВЫДЕЛЕНИИ k-ro ЭЛЕМЕНТА УПОРЯДОЧЕННОЙ СОВОКУПНОСТИ ПУТЕМ ПОПАРНЫХ СРАВНЕНИЙ” (“On the selection of the k-th element of an ordered set by pairwise comparisons”)
,
Sibirskiĭ Mat. Zh.
5
,
557
564
.
31.
Knuth
,
D.
(
1998
).
The Art of Computer Programming
, 2nd ed. (
Addison-Wesley Professional
,
New York
).
32.
Kuk
,
F. K.
(
2002
). “
Paired comparisons as a fine-tuning tool in hearing aid fittings
,” in
Strategies for Selecting and Verifying Hearing Aid Fittings
, edited by
M.
Valente
(
Georg Thieme Verlag Stuttgart
,
New York
).
33.
Lachenmayr
,
W.
,
Meyer-Kahlen
,
N.
,
Gomes
,
O. C.
,
Kuusinen
,
A.
, and
Lokki
,
T.
(
2023
). “
Chamber music hall acoustics: Measurements and perceptual differences
,”
J. Acoust. Soc. Am.
154
,
388
400
.
34.
Lokki
,
T.
,
Pätynen
,
J.
,
Kuusinen
,
A.
, and
Tervo
,
S.
(
2016
). “
Concert hall acoustics: Repertoire, listening position, and individual taste of the listeners influence the qualitative attributes and preferences
,”
J. Acoust. Soc. Am.
140
(
1
),
551
562
.
35.
Luce
,
R. D.
(
1959
).
Individual Choice Behavior
(
John Wiley
,
Oxford, UK
), p.
153
.
36.
Marchand
,
E.
(
2002
). “
On the comparison between standard and random knockout tournaments
,”
J. R. Statistical Soc. Ser. D
51
,
169
178
.
37.
Maurer
,
W.
(
1975
). “
On most effective tournament plans with fewer games than competitors
,”
Ann. Statist.
3
,
717
727
.
38.
McGarry
,
T.
, and
Schutz
,
R. W.
(
1997
). “
Efficacy of traditional sport tournament structures
,”
J. Oper. Res. Soc.
48
,
65
74
.
39.
Miller
,
B. L.
, and
Goldberg
,
D. E.
(
1995
). “
Genetic algorithms, tournament selection, and the effects of noise
,”
Complex Syst.
9
,
193
212
, available at https://www.complex-systems.com/abstracts/v09_i03_a02/.
40.
Montgomery
,
A. A.
,
Schwartz
,
D. M.
, and
Punch
,
J. L.
(
1982
). “
Tournament strategies in hearing aid selection
,”
J. Speech Hear. Disord.
47
,
363
372
.
41.
Neuman
,
A. C.
,
Levitt
,
H.
,
Mills
,
R.
, and
Schwander
,
T.
(
1987
). “
An evaluation of three adaptive hearing aid selection strategies
,”
J. Acoust. Soc. Am.
82
,
1967
1976
.
42.
O'Mahony
,
M.
, and
Rousseau
,
B.
(
2003
). “
Discrimination testing: A few ideas, old and new
,”
Food Qual. Preference
14
(
2
),
157
164
.
43.
R Core Team
(
2022
). “
R: A language and environment for statistical computing
,” R Foundation for Statistical Computing, Vienna, Austria, https://www.R-project.org/ (Last viewed June 18, 2024).
44.
Rousseau
,
B.
, and
Ennis
,
D. M.
(
2002
). “
The multiple dual-pair method
,”
Percept. Psychophys.
64
(
6
),
1008
1014
.
45.
Ryvkin
,
D.
(
2010
). “
The selection efficiency of tournaments
,”
Eur. J. Oper. Res.
206
(
3
),
667
675
.
46.
Ryvkin
,
D.
, and
Ortmann
,
A.
(
2008
). “
The predictive power of three prominent tournament formats
,”
Manage. Sci.
54
(
3
),
492
504
.
47.
Scarf
,
P.
,
Yusof
,
M. M.
, and
Bilbao
,
M.
(
2009
). “
A numerical study of designs for sporting contests
,”
Eur. J. Oper. Res.
198
(
1
),
190
198
.
48.
Schroeder
,
M. R.
,
Gottlob
,
D.
, and
Siebrasse
,
K. F.
(
1974
). “
Comparative study of European concert halls: Correlation of subjective preference with geometric and acoustic parameters
,”
J. Acoust. Soc. Am.
56
(
4
),
1195
1201
.
49.
Searls
,
D. T.
(
1963
). “
On the probability of winning with different tournament procedures
,”
J. Am. Stat. Assoc.
58
(
304
),
1064
1081
.
50.
Soulodre
,
G. A.
, and
Bradley
,
J. S.
(
1995
). “
Subjective evaluation of new room acoustic measures
,”
J. Acoust. Soc. Am.
98
(
1
),
294
301
.
51.
Studebaker
,
G. A.
(
1982
). “
Hearing aid selection: An overview
,” in
The Vanderbilt Hearing-Aid Report: State of the Art–Research Needs
, edited by
G. A.
Studebaker
and
F. H.
Bess
(
Springer
,
New York
), pp.
147
155
.
52.
Studebaker
,
G. A.
,
Bisset
,
J. D.
, and
Ort
,
D. M. V.
(
1982
). “
Paired comparison judgments of relative intelligibility in noise
,”
J. Acoust. Soc. Am.
72
,
80
92
.
53.
Sziklai
,
B. R.
,
Biró
,
P.
, and
Csató
,
L.
(
2022
). “
The efficacy of tournament designs
,”
Comput. Oper. Res.
144
,
105821
.
54.
Tervo
,
S.
,
Pätynen
,
J.
,
Kuusinen
,
A.
, and
Lokki
,
T.
(
2013
). “
Spatial decomposition method for room impulse responses
,”
J. Audio Eng. Soc.
61
(
1
),
17
28
.
55.
Thurstone
,
L. L.
(
1927
). “
A law of comparative judgment
,”
Psychol. Rev.
34
,
273
286
.
56.
Voong
,
T. M.
, and
Oehler
,
M.
(
2019
). “
Tournament formats as method for determining best-fitting HRTF profiles
,” in
Proceedings of the 23rd International Congress on Acoustics
,
September 9–13
,
Aachen, Germany
, pp.
4841
4847
.
57.
Waskom
,
M. L.
(
2021
). “
seaborn: Statistical data visualization
,”
J. Open Source Softw.
6
(
60
),
3021
.
58.
Zacharov
,
N.
(
2018
).
Sensory Evaluation of Sound
(
CRC Press
,
Boca Raton, FL
).
You do not currently have access to this content.