There has been increasing attention in the literature to wearable acoustic recording devices, particularly to examine naturalistic speech in disordered and child populations. Recordings are typically analyzed using automatic procedures that critically depend on the reliability of the collected signal. This work describes the acoustic amplitude response characteristics and the possibility of acoustic transmission loss using several shirts designed for wearable recorders. No difference was observed between the response characteristics of different shirt types or between shirts and the bare-microphone condition. Results are relevant for research, clinical, educational, and home applications in both practical and theoretical terms.

Use of wearable acoustic recording technology has been steadily increasing in research, clinical, educational, and home settings.1–8 Modern technology is capable of capturing day-long audio samples in naturalistic, family environments from the auditory perspective of a user wearing the recording device. Critically, the naturalistic recording is analyzed (typically offline) using automatic speech processing (ASP) and/or automatic speech recognition (ASR) software to generate variables of interest to the user. One such system used to analyze children's speech and language characteristics is produced by the LENA Research Foundation (LRF; Boulder, CO). It consists of a wearable acoustic recording device and custom ASP software. The recorder is designed to be worn in a custom-fit pocket on the front of a shirt worn by a child. The system records and stores the full-day of audio for further analysis or playback. The goal of the system is to provide objective estimates of variables that are known to influence speech, language, and social development. Studies using this device have examined total word and conversational interaction exposures with adults,1,2,8 electronic media exposure,2,9 and the system has been demonstrated to be useful in the assessment of children with Autism spectrum disorder,3,4 language delay,3 hearing loss,5,9 and with preterm infants.7 

Although wearable recording devices have been used extensively in a variety of settings, there are gaps in the scientific knowledge of the practical characteristics and interpretability of the audio recordings due to the way the recordings were obtained. In particular, acoustic transfer characteristics of the clothing used to collect the raw recordings is not known. Acoustic transmission loss (TL) has been examined generally for fabric positioned in front of a noise source10–12 and in front of a voice,13,14 but not for wearable recording systems such as the LENA. In general, denser material composition (i.e., weave density) and indirect microphone placement increase amplitude loss at frequencies greater than 1 kHz. Woven cotton has been shown not to influence TL up to 2.5 kHz, while other materials such as paper surgical masks have been shown to have increased TL above about 250 Hz.14 Other factors directly contributing to TL include the relationship between the noise or voice source and the polar response pattern of the microphone and signal interference from external noise or environmental sources.

There are no known reports of TL of a wearable recording device in situ. A better understanding of the acoustic properties of the transmission medium will increase our ability to effectively use and manage the signals collected from these devices. This report describes acoustic transfer characteristics of a bare-microphone compared with two types of clothing used with a specific, commonly-used recording system.

The LENA is a commercially available, body-worn audio recording system. The system includes a recorder weighing about 70 g and measuring about 1 × 5 × 8 cm designed to be a self-contained acoustic recording and storage device worn by a child for a whole day. The system requires peripheral clothing with a pocket to render the recorder wearable. One of the primary goals of securing the recorder inside the chest pocket is to fix the recorder's integrated microphone at a specific distance (10–15 cm) from the wearer's mouth. The LRF-designed standard clothing (SC) features a custom pocket manufactured into the product. Custom clothing (CC) reproductions were fabricated by the author based on the design patterns provided by the LRF. Both shirt types were comprised of plain cotton or cotton-blended fabric featuring a chest pocket internally backed with padded material. Three edges of the pockets were permanently sewn, and one edge had several snaps to secure the recorder in place. The LENA system used in the present experiments is designed so that the recorder remains snugly in place within the pocket, fixing its position inside the pocket and with respect to the wearer's body and mouth. The pockets were designed to be unobtrusive for the child wearing the shirt. Individual SC and CC shirts differed in color, size, and/or wear-and-tear, and CC shirts differed additionally in blend of fabric materials and/or stitching patterns. Overall, the selected shirts were soft and unadorned (e.g., without rhinestones or buttons) without extensive screen-printing or logos.

A total of 126 stimuli were constructed at each of nine frequencies [modulated (5%) pure-tones at 64, 125, 250, 500, 1000, 2000, 4000, 8000, and 16 000 Hz] and 14 amplitude levels [5 dB steps from 25–90 dB sound pressure level (SPL)]. Stimuli were presented from a near-field monitor loudspeaker (model 8030A, Genelec, Iisalmi, Finland). A calibrated measurement microphone (model XL2, NTi Audio, Schaan, Liechtenstein) was used to collect amplitude measurements in a bare baseline condition and inside the pocket of the clothing. No person occupied the room during testing. All measurements were collected inside a calibrated IAC sound-attenuated booth in the Speech and Language Lab at Washington State University.

For all recordings, the microphone was mounted on a boom stand and fitted with a foam windscreen to ensure consistent placement and no direct contact for the conditions inside the pocket. For all data collection, the head of the microphone was positioned about one-half inch from an acoustically-treated easel in the testing booth. The angle of the microphone and positions of all items in the recording room remained fixed throughout testing.

In the baseline condition, the microphone was bare with no object between it and the acoustic source. The purpose of the baseline condition was to establish the experimental setup so that the presentation levels of each stimulus were controlled. In the within pocket condition, each shirt was tacked to the easel simulating normal wear, and the microphone was placed inside the front pocket. The position of the measurement microphone into the pocket simulated the microphone position of the recorder in real world applications. Using this procedure, baseline response values were directly comparable to response values collected in the test conditions with the microphone inside the shirt pockets, thus allowing for direct assessment of the influence, if any, of the shirt fabric on the transmission of the acoustic signal.

Ten representative examples of each SC and CC clothing type were tested. Shirts were in regular-use rotation and were freshly laundered. All 126 stimuli were presented to each of 20 shirts. In total, 2520 amplitude observations were recorded. Within group comparisons used a one-sample t-test, and between-group comparisons used a independent-sample t-test. All statistical tests were two-tailed.

There were no within-group differences for either the SC or CC clothing types for any of the 126 stimuli [t(9) < 0.5, p > 0.2] or for observations pooled within each presentation level compared across each frequency [t(9) < 0.5, p > 0.3]. These results indicate similar response characteristics among the exemplars within each shirt type. There were no between-group differences for SC-CC, SC-baseline, or CC-baseline comparisons at individual stimuli or pooled by frequency [t(9) < 0.5, p > 0.2], indicating similar response characteristics between exemplars and the known baseline signal characteristics. Mean response amplitude values at each frequency and stimulus presentation value are displayed in Fig. 1, showing amplitude on the ordinate and log-scaled frequency on the abscissa. The stimuli reference values are shown by the dashed line, responses in the SC condition in circle markers, and responses in the CC condition in diamond markers. Table 1 shows response characteristics for each shirt type and the bare-microphone reference value. The difference between SC and CC shirt types at each of the 14 stimulus frequencies was not statistically significant [t(13) = 1.13, p > 0.2].

Fig. 1.

Amplitude response characteristics inside SC (circle marker heavy line), and inside CC (diamond marker, light line). The stimulus is given by the dashed line.

Fig. 1.

Amplitude response characteristics inside SC (circle marker heavy line), and inside CC (diamond marker, light line). The stimulus is given by the dashed line.

Close modal
Table 1.

Mean response amplitude of SC and CC shirt types with reference to bare-microphone stimulus value for reference. The difference between SC and CC shirt types is also given.

Stimulus amplitude (dB SPL)
25.030.035.040.045.050.055.060.065.070.075.080.085.090.0
SC shirt 24.3 29.0 34.2 39.1 44.3 49.1 54.3 59.2 64.3 69.1 74.2 79.2 84.3 89.4 
CC shirt 24.2 28.9 34.3 38.9 43.9 49.1 54.4 59.0 64.0 69.0 74.3 79.7 84.3 89.2 
SC – CC 0.1 .0.1 −0.1 0.2 0.4 0.0 −0.1 0.2 0.3 0.1 −0.1 −0.5 0.0 0.2 
Stimulus amplitude (dB SPL)
25.030.035.040.045.050.055.060.065.070.075.080.085.090.0
SC shirt 24.3 29.0 34.2 39.1 44.3 49.1 54.3 59.2 64.3 69.1 74.2 79.2 84.3 89.4 
CC shirt 24.2 28.9 34.3 38.9 43.9 49.1 54.4 59.0 64.0 69.0 74.3 79.7 84.3 89.2 
SC – CC 0.1 .0.1 −0.1 0.2 0.4 0.0 −0.1 0.2 0.3 0.1 −0.1 −0.5 0.0 0.2 

This report compares the amplitude response characteristics of a microphone placed in a bare condition and inside the fabric pockets of two shirt types commonly used for wearable audio recorders. Test stimuli comprised a range of amplitudes and frequencies important for speech production and perception. Reliable differences were not observed at any test frequency or level, within exemplars of either type of clothing, or among the shirt types and the bare microphone conditions. These findings suggest that the acoustic influence of the shirt materials tested here may have a negligible impact on recordings collected with wearable technology.

The results reported here are evidence that the acoustic recordings may be robust within the limits of the clothing design, and users may benefit from expanded options or applications for use with a recording device inside CC or with a bare-microphone. The possibility of expanding clothing options for children and adults with special needs, families with social, religious, or clinical restrictions, or in educational or clinical settings may be warranted. In addition, this result may reduce the overall resource expense of both time and financial burden to researchers, clinicians, educators, and parents if special clothing or bare-microphone conditions constitute reliable, less expensive alternatives.

A better understanding of the acoustic signal collected from a wearable recorder is useful to a variety of applications including ASP, ASR, and signal processing used in applications such as hearing aids and telephony. For these applied problems, the response characteristics of the system are important because acoustic details of the recording are (typically) the only input to the system. Current wearable recording systems such as the LENA technology have been shown to be useful for a variety of research questions,2–5,7–9,15 and there has been increasing interest in assessing details of the acoustic recordings.3,6,15,16 An understanding of the characteristics of the collected signal may influence the nature of future inquiry using this or a similar application.

This work has several limitations. First, testing was conducted in the controlled laboratory environment with highly controlled stimuli and simulated-use environments. Ecological validity and a greater diversity of testing materials is needed, including comparing more varied fabrics and testing with a variety of users under various real-world situations. Second, statistical tests here showed non-significant effects which are difficult to directly interpret. Third, although statistical differences among acoustic response characteristics were not found, specific ASP or ASR algorithms may be sensitive to the acoustic noise or TL.

With increasing interest in wearable, passive technology in a wide variety of clinical, educational, and home settings, a better understanding of response characteristics is crucial to continued and extended applications. The work presented here will benefit researchers, clinicians, educators, and caregivers using wearable acoustic recording devices in a variety of contexts and may allow researchers to expand investigations in new directions in both theoretical and applied contexts.

1.
J.
Gilkerson
and
J. A.
Richards
, “
The LENA Natural Language Study, Technical Report LTR-02-2
” (LENA Research Foundation,
2008
), http://www.lenafoundation.org/TechReport.aspx/Natural_Language_Study/LTR-02-2 (Last viewed September 6, 2009).
2.
D. A.
Christakis
,
J.
Gilkerson
,
J. A.
Richards
,
F. J.
Zimmerman
,
M. M.
Garrison
,
D.
Xu
,
S.
Gray
, and
U.
Yapanel
, “
Audible television and decreased adult words, infant vocalizations, and conversational turns: A population-based study
,”
Arch. Pediatr. Adolesc. Med.
163
(
6
),
554
558
(
2009
).
3.
D. K.
Oller
,
P.
Niyogi
,
S.
Gray
,
J. A.
Richards
,
J.
Gilkerson
,
D.
Xu
,
U.
Yapanel
, and
S. F.
Warren
, “
Automated vocal analysis of naturalistic recordings from children with autism, language delay, and typical development
,”
Proc. Natl. Acad. Sci. U.S.A.
107
(
30
),
13354
13359
(
2010
).
4.
S. F.
Warren
,
J.
Gilkerson
,
J. A.
Richards
,
D. K.
Oller
,
D.
Xu
,
U.
Yapanel
, and
S.
Gray
, “
What automated vocal analysis reveals about the vocal production and language learning environment of young children with Autism
,”
J. Autism Dev. Disord.
40
,
555
569
(
2010
).
5.
M.
VanDam
,
S. E.
Ambrose
, and
M. P.
Moeller
, “
Quantity of parental language in the home environments of hard-of-hearing 2-year-olds
,”
J. Deaf Stud. Deaf Educ.
17
(
4
),
402
420
(
2012
).
6.
A.
Sangwan
,
A.
Ziaei
, and
J. H. L.
Hansen
, “
ProLifeLog: Environmental analysis of keyword recognition for naturalistic daily audio streams
,” poster presented at ICASSP 2012 (Kyoto, Japan,
2012
).
7.
M.
Caskey
,
B.
Stephens
,
R.
Tucker
, and
B.
Vohr
, “
Importance of parent talk on the development of preterm infant vocalizations
,”
Pediatrics
128
(
5
),
910
916
(
2011
).
8.
F. J.
Zimmerman
,
J.
Gilkerson
,
J. A.
Richards
,
D. A.
Christakis
,
D.
Xu
,
S.
Gray
, and
U.
Yapanel
, “
Teaching by listening: The importance of adult-child conversations to language development
,”
Pediatrics
124
(
1
),
342
349
(
2009
).
9.
S. E.
Ambrose
,
M.
VanDam
, and
M. P.
Moeller
, “
Linguistic input, electronic media, and communication outcomes in toddlers with hearing loss
,”
Ear Hear.
35
(
2
),
139
147
(
2014
).
10.
M. E.
Nute
and
K.
Slater
, “
The effect of fabric parameters on sound transmission loss
,”
J. Textile Inst.
64
(
11
),
652
658
(
1973
).
11.
M. E.
Nute
and
K.
Slater
, “
A rapid method for studying acoustic behavior
,”
J. Acoust. Soc. Am.
54
(
6
),
1747
1749
(
1973
).
12.
S.
Aso
and
R.
Kinoshita
, “
Absorption of sound wave by fabrics, part I: Absorption mechanisms
,”
J. Textile Machinery Soc. Japan (English edition)
9
,
1
15
(
1963
).
13.
C.
Llamas
,
P.
Harrison
,
D.
Donnelly
, and
D.
Watt
, “
Effects of different types of face coverings on speech acoustics and intelligibility
,”
York Papers Ling.
9
(
2
),
80
104
(
2009
).
14.
V. B.
Biesel
and
K. A.
Cunefare
, “
Insertion loss of personal protective clothing
,”
Appl. Acoust.
62
(
7
),
831
840
(
2001
).
15.
M.
VanDam
, “
Automatic acoustic-phonetic analyses of thousands of hours of conversational speech in hard of hearing children
,” Poster presented at the 161st Meeting of the Acoustical Society of America (Seattle, WA,
2011
).
16.
M.
VanDam
and
N. H.
Silbert
, “
Precision and error of automatic speech recognition
,”
Proc. Meet. Acoust.
19
,
060006
(
2013
).