Methodology — TrueNoise

Section 1

Hardware

Microphone

ModelPoP Voice Professional Lavalier omnidirectional condenser
Frequency range20 Hz – 16 kHz
Signal-to-noise ratio80 dB
ConnectionWired (no Bluetooth codec compression or variable latency)
Wind protectionFoam + fur windshield (per ISO 1996 guidance)
Polar patternOmnidirectional

The frequency range fully covers the A-weighted measurement band: A-weighting rolls off by ~50 dB below 20 Hz and ~7 dB above 16 kHz, so this microphone captures everything the A-weighting curve cares about. The omnidirectional pattern is deliberate — aircraft can fly in from any bearing, and equal sensitivity in all directions avoids the orientation bias a cardioid pattern would introduce.

This is a consumer-grade microphone, not an IEC 61672-certified measurement microphone. We address that directly in §6 Calibration and §8 Limitations.

Phone

DeviceApple iPhone with A-series silicon
Audio session mode.measurement (disables all processing — see §2)
CalibrationPer-device offset stored locally and applied to every measurement

Section 2

iOS audio capture

We use Apple's AVAudioEngine framework with the audio session category set to .playAndRecord and mode set to .measurement.

This mode is the critical detail. By default, iOS aggressively processes microphone input for voice calls and video — automatic gain control, noise suppression, voice isolation, and echo cancellation are all applied. None of these are acceptable for an instrument. .measurement mode disables all of them, exposing the raw uncompressed PCM signal exactly as the microphone capsule delivers it.

Tap buffer size1024 frames (~21 ms at 48 kHz)
Sample format32-bit float
Sample rateHardware-native (typically 48 kHz)

Critics sometimes assume "phone" implies "consumer audio pipeline." With .measurement mode, it does not.

Section 3

A-weighting filter

We implement A-weighting per IEC 61672, the international standard for sound level meters.

The filter is constructed as three cascaded biquad sections derived via bilinear transform from the analog s-domain transfer function. The four canonical IEC pole frequencies are used directly:

f₁20.598997 Hz
f₂107.65265 Hz
f₃737.86223 Hz
f₄12194.217 Hz

Section 1 is a 2nd-order high-pass at f₁. Section 2 is a 2nd-order high-pass combining the f₂ and f₃ poles. Section 3 is a 2nd-order low-pass at f₄. The overall response is normalized so the gain at 1 kHz is exactly 0 dB — the IEC reference point.

These are the same pole frequencies used in every certified sound level meter manufactured. A-weighting compensates for the frequency response of the human ear, which is roughly 30 dB less sensitive at 100 Hz than at 1 kHz. Measuring aircraft noise without A-weighting would dramatically overstate low-frequency content that humans don't perceive as loud.

Section 4

FFT and ⅓-octave band analysis

A-weighted SPL is the headline number, but it collapses the entire audible spectrum into a single decibel value. For finer analysis, we compute a ⅓-octave spectrum on every measurement window.

FFT engineApple Accelerate / vDSP — the same SIMD-optimized DSP library Apple uses in Logic Pro and that ships in every professional iOS audio plugin
FFT size4096 samples (~85 ms window at 48 kHz)
Window functionHann window with energy normalization — band levels are calibrated in absolute SPL, not arbitrary spectral units
Frequency resolution~11.7 Hz per bin
Bands28 standard ⅓-octave bands, 25 Hz – 12.5 kHz (ANSI S1.11 / IEC 61260 center frequencies)
Band powerSummed across all FFT bins within each band's bounds (edge factor 2^(1/6))

The A-series chip's vector and neural processing capabilities exceed what real-time spectral analysis requires by orders of magnitude. A 4096-point FFT executes in well under a millisecond. Doubts about "can a phone do FFT accurately" reflect assumptions about pre-2010 phone hardware.

Section 5

Psychoacoustic analysis

A-weighted SPL is a 1936 model of how humans perceive loudness. It is the regulatory standard worldwide, but it is known to be incomplete — particularly for impulsive sounds, tonal components, and the spectral character that makes some noises feel sharper or more annoying than their dBA value would suggest. See The measurement gap for more information.

We compute, on top of A-weighting, three psychoacoustic metrics from the acoustics literature:

Zwicker loudness (sone)Simplified ISO 532 B. Computes specific loudness per Bark critical band, integrates across the Bark scale. Uses ISO 226 threshold-in-quiet values per band.
Loudness level (phon)Derived from total loudness in sones. Numerically equals the dB SPL of a 1 kHz tone perceived as equally loud.
Sharpness (acum)Per DIN 45692. Weights high-frequency specific loudness to capture the perceptual difference between a low rumble and a piercing whine at the same dBA.
Psychoacoustic annoyanceZwicker's combined model synthesizing loudness and sharpness into a single annoyance index.

These metrics matter because two aircraft producing the same dBA can produce very different real-world annoyance — and annoyance is the dose-response variable that WHO uses for sleep disturbance and cardiovascular health endpoints. We provide psychoacoustic analysis in addition to A-weighted SPL, not instead of it.

Empirical finding — loudness/dBA inversion in aft-departure overflights

The TrueNoise dataset contains a finding that directly demonstrates criterion 3 from the methodology intro — that per-event psychoacoustic detail captures what averaged-Leq metrics by design discard. In the aft portion of a departure overflight (aircraft receding, exhaust jet mixing noise dominant), dBA continues to fall while Zwicker loudness in sone temporarily rises or plateaus. The A-weighted level drops because A-weighting suppresses the low-frequency exhaust mixing energy as the aircraft moves away. Perceived loudness rises because the human auditory system — and the Zwicker model — gives more weight to that low-frequency content than A-weighting does.

This is not a measurement artifact. It is a real psychoacoustic phenomenon: the aft-departure sound is perceived as louder than dBA alone would predict. It is also the strongest in-house empirical evidence that a standard relying on dBA alone systematically undercounts the health-relevant acoustic load of jet departures. The C-A delta and sone metrics together capture the inversion; dBA alone cannot see it.

Section 6

Calibration

Every iPhone unit, microphone unit, and microphone connection has slightly different acoustic gain. A measurement system that does not address this is not a measurement system.

We calibrate per device. The app stores a calibration offset (in dB) that converts the device's raw signal level to absolute SPL. Two offset profiles are supported:

Internal mic offset+108 dB. Averaged across multiple calibration sessions on the validation iPhone. NOT yet re-validated against the AZ8930-verified reference chain — internal microphone is not used for primary field measurement; external microphone is the reference for all field sessions.
External mic offset+96 dB (PoP Voice lavalier) — corrected from +99 dB on 1 June 2026; see §7b
Session-to-session variability±3 dB observed — offset is averaged across multiple calibration sessions
Reference instrumentWintact SLM-30B IEC 61672 Class 2 sound level meter

The offset is determined by simultaneous measurement against a reference instrument and is stored locally in the app. This is the same approach used by any professional sound level meter: every unit is individually calibrated, and calibration is checked periodically against a reference.

Windshield insertion loss characterization

Windshields attenuate real acoustic signal in addition to suppressing wind noise — more material in the signal path means more signal loss, particularly at high frequencies where wavelengths approach the windshield's structural dimensions. The fur windshield insertion loss of 1.8 dBA was determined by a delta measurement: simultaneous bare-mic and fur-mic readings of the same signal, with the TA657A as a co-located reference to anchor the comparison.

Configuration	Measured insertion loss	Published industry range	Notes
Bare mic (no windshield)	0.0 dBA	0 dB reference	Reference baseline for delta measurement
Foam windshield	0.6–0.8 dBA	0.5–1.0 dBA	Within published band (Rycote, Brüel & Kjær)
Fur windshield (deadcat)	1.8 dBA	1.5–3.0 dBA	Used for all outdoor monitoring sessions

The 1.8 dBA fur correction is robust to the TA657A drift discovered on 1 June 2026. Because insertion loss is a difference between bare and fur readings on the same signal, any absolute error in the TA657A's reference level cancels in the subtraction. Whatever the meter read high by, both bare and fur readings inherited the same error; the delta between them is unaffected. The fur correction value does not need to be re-derived following the calibration correction.

Methodological note — indoor bench characterization retired. Early windshield testing was conducted indoors with a consumer audio source, with the app mic and SLM-30B co-located near the speaker. Subsequent analysis identified that this setup is subject to near-field positional variability (within approximately λ/2 of a finite source, sound pressure varies substantially with position), speaker frequency response non-linearity, and room boundary reflections — all of which contaminate absolute spectral characterization. The bench setup cannot support claims of multi-spectrum characterization with the rigor implied by "broadband, tonal, impulsive" testing against a reference meter. That framing has been retired from this page. Future windshield characterization will use a co-located outdoor far-field protocol, in which both microphones are placed at identical geometry relative to a distant source, eliminating near-field and reflection artifacts. No date has been set for this work. The 1.8 dBA insertion loss figure is retained as a reasonable historical estimate, within the published industry range of 1.5–3.0 dBA for fur windshields, and robust to the TA657A drift as described above.

All outdoor monitoring sessions are conducted with the fur windshield installed. Published measurements therefore carry a known conservative correction of 1.8 dBA — actual noise levels at the receptor are 1.8 dB higher than reported values. This correction is applied automatically by the app based on the per-session windshield configuration. Researchers requiring the corrected values may apply the 1.8 dBA offset to sessions where windshield configuration is not yet logged.

Multi-geometry validation

The calibrated system has been validated across three independent operating points, spanning two measurement geometries and two signal types:

Scenario	Reference	Signal / geometry	Delta	What it validates
A — Calibrator	AZ8930 (IEC 60942 Class 2) · 94.0 dBA · 1 kHz	Pure tone · bare mic · no windshield · no correction	−0.3 dB	Microphone + +96 dB offset sub-chain only
B — Standardized Receptor	Wintact SLM-30B (IEC 61672 Class 2, AZ8930-verified) · B767 overflight	Broadband aircraft · fur windshield · +1.8 dB correction · free-field	−0.1 dB	Full field chain: mic + offset + windshield correction
C — Façade-Level	Wintact SLM-30B (IEC 61672 Class 2, AZ8930-verified) · 8 aircraft	Broadband aircraft · fur windshield · +1.8 dB correction · façade geometry	−0.78 dB mean range −0.4 to −1.1 dB	Full field chain at façade geometry · 8/8 conservative

The three scenarios are stronger together than any alone. Scenario A confirms the microphone and offset are correctly calibrated against a primary acoustic reference. Scenario B confirms the full field chain on a real aircraft at a standardized free-field position. Scenario C confirms the full field chain at a façade boundary position across eight aircraft spanning four aircraft types — with all eight deltas in the conservative direction. The Façade-Level bias of −0.78 dB relative to Standardized Receptor (−0.1 dB) is attributable to position-dependent reflection interference between non-co-located capsules at the boundary layer, not a calibration offset. Altitude within the SR geometry does not appear to drive the delta — the geometry effect (0.7 dB) and within-SR altitude variation (0.6 dB range) are comparable in magnitude.

The system is calibrated within IEC 61672 Class 2 ±2 dB tolerance and shows agreement substantially tighter than that tolerance across all validated operating points. Post-correction validation spans 11 aircraft across 2 measurement geometries.

Validation chain summary: AZ8930 calibrator (IEC 60942 Class 2, primary reference) → Wintact SLM-30B (IEC 61672 Class 2, −0.0 dB against AZ8930) → TrueNoise app + external mic + +96 dB offset + fur windshield + 1.8 dB correction (−0.3 dB Scenario A · −0.1 dB Scenario B · −0.78 dB mean Scenario C). 11 post-correction aircraft · overall mean Δ ≈ −0.55 dB · range −1.1 to +0.2 dB. The TA657A reference meter has been retired following identification of frequency-dependent drift that could not be corrected by trimming.

Section 6b

Measurement position classification

Where the microphone is placed relative to the acoustic environment determines what the measurement represents. TrueNoise uses a five-category measurement type framework, implemented in the iOS app and serialized in every CSV row. The categories distinguish between measurements designed for policy comparability and measurements that characterize real-world residential exposure as actually experienced.

Standardized Receptor 1.2 m height above ground · 1.5 m from any reflective surface · open-field geometry. This is the WHO/ISO 1996 receptor position used in European noise mapping and epidemiological studies including HYENA and RANCH. Use for: regulatory comparison, cross-location comparability, policy submissions. The TrueNoise calibration validation comparisons against the Wintact SLM-30B were performed at this position.
Community Receptor Any outdoor residential position that does not meet Standardized Receptor geometry — a backyard, a front porch, a deck chair, a child's play area. Represents actual residential exposure as experienced, not a regulated abstraction. Values at Community Receptor positions may differ from Standardized Receptor values at the same location due to reflective surfaces, terrain, and vegetation. Use for: community impact documentation, lived-experience characterization.
Façade-Level Microphone placed at or near an exterior building surface — a window frame, a wall face, a balcony railing. Captures the sound level at the building envelope where transmission into interior spaces begins. Façade measurements are typically 2–6 dB higher than free-field measurements at the same location due to building reflection. Use for: indoor noise intrusion estimation, building envelope characterization.
Field Characterization Exploratory measurement at a non-residential location — a park, a school playground, a community green space, a roadside. Used to characterize acoustic conditions at locations not covered by residential receptor categories. Position documented in the Position Description field.
Hand-held Microphone held by the observer rather than mounted or positioned on a fixed support. Introduces variability from hand position, body reflection, and movement. Results are indicative rather than calibration-grade. Flagged in the dataset; not used for threshold comparison or regulatory claims.

The Measurement Type field is present in every downloaded CSV row. When interpreting the dataset, filter to Standardized Receptor for policy-comparable absolute SPL claims. Community Receptor and Façade-Level measurements document real residential exposure and are appropriate for community impact characterization but should not be directly compared to Standardized Receptor values without noting the positional difference.

Most consumer noise measurement apps record a level without documenting where or how the microphone was placed. The five-category Measurement Type framework — with explicit Standardized vs Community Receptor distinction — is policy-grade measurement discipline that treats geometry as a first-class variable rather than an afterthought.

Section 6c

Wind contamination screening — C-A Delta

A-weighting and C-weighting

Sound level meters apply a frequency-weighting filter to raw acoustic measurements. Two standardized weightings are relevant here, both defined in IEC 61672:

A-weighting (dBA)Approximates how the human ear perceives loudness at typical environmental sound levels. Heavily attenuates frequencies below 500 Hz. Used in nearly all environmental noise regulation — FAA contours, WHO guidelines, EPA standards.
C-weighting (dBC)Much closer to a flat frequency response — includes far more low-frequency content than A-weighting. Used as a diagnostic complement to A-weighting.

Why the C-A delta detects wind

Wind noise on a microphone is not acoustic — it is turbulent air pressure directly on the diaphragm, overwhelmingly low-frequency. A-weighting heavily suppresses energy below 200 Hz; C-weighting does not. A large C-A delta during a loud event is therefore a strong indicator of wind contamination. Thresholds apply only when dBA ≥ 55 — during meaningful aircraft events:

C-A Delta < 15 dBClean — measurement dominated by genuine acoustic content
C-A Delta 15–25 dBPossible wind contamination — flag for review; see aft-aspect note below
C-A Delta ≥ 25 dBStrong wind contamination — exclude from SPL analysis by default

Four documented patterns beyond wind contamination

1. Aft-aspect aircraft noise. When an aircraft is receding, jet exhaust mixing noise dominates — a deep, low-frequency rumble. C-A delta naturally climbs to 15–20 dB in the aft tail. This is real aircraft noise, not contamination. Distinguish from wind by timing (elevated delta appears after dBA peak, while aircraft confirmed receding via ADS-B) and by the presence of elevated sone at the same timestep.

2. Quiet ambient background. Below 55 dBA, the microphone captures the natural low-frequency character of the environment — HVAC, distant traffic, building hum. Large C-A delta at low dBA is ambient bass character, not contamination. The SPL floor prevents this from triggering false exclusions.

3. Atmospheric high-frequency absorption at distance. A fourth cause identified empirically: as an aircraft moves away, the atmosphere preferentially absorbs high-frequency content (ISO 9613-1). This produces a gradually rising C-A delta in track-event rows as slant range increases and dBA falls — a propagation effect, not a source or contamination effect. It should not be treated as grounds for exclusion.

Practical rules

15–20 dB, receding aircraft, calm dayDo not exclude — real aft-aspect aircraft noise
15–20 dB, breezy day, no specific overflightProbably wind contamination — exclude
≥ 25 dB during any loud eventLikely wind — exclude
Large delta, dBA < 55Ambient bass character — not contamination, not excluded

All outdoor sessions use a fur windshield (characterized insertion loss: 1.8 dBA), which suppresses wind noise significantly. The iOS app's Review segment displays C-A delta alongside ADS-B bearing and slant range, giving the observer the context needed to apply these rules correctly. Observations marked excluded in the app are filtered before upload and do not appear in the public dataset.

Glossary

A-weighting (dBA)A frequency filter applied to measurements to match human hearing sensitivity; used in nearly all environmental noise regulations.
C-weighting (dBC)A nearly-flat frequency filter that includes more low-frequency content; used as a diagnostic complement to A-weighting.
C-A Delta (dB)The difference between a C-weighted and A-weighted measurement of the same signal; large values during loud measurements (≥55 dBA) indicate wind contamination — but context (ADS-B, weather, timing) is required to distinguish wind from aft-aspect aircraft noise.
Aft-aspect noiseJet exhaust mixing noise heard when an aircraft is receding. Naturally low-frequency dominated, producing a large C-A delta that is real aircraft noise, not a measurement artifact.

Section 6d

Psychoacoustic metrics and health evidence

A natural question about the TrueNoise measurement approach is why it captures psychoacoustic metrics — loudness in sone, sharpness in acum, annoyance index — rather than relying solely on dBA. The answer is grounded in the health evidence.

The health effects of aircraft noise are primarily mediated through the annoyance response — and annoyance is determined not by loudness alone, but by the spectral and temporal character of the sound. A longitudinal study found that nearly 66% of the effect on self-reported health was mediated by annoyance, not by the noise level directly (Cousson et al., 2024). The pathway from aircraft noise to cardiovascular disease runs through the subjective experience of the sound — which psychoacoustic metrics capture and dBA alone does not.

After controlling for loudness, sharpness and tonality independently predict annoyance to aircraft noise (McKinley et al., 2023; Caillet et al., 2016). Two sounds at identical dBA levels can produce substantially different annoyance depending on their spectral character. dBA misses this distinction entirely.

The psychoacoustic metrics TrueNoise captures are therefore not merely descriptive — they are the upstream acoustic drivers of the health outcomes documented in the HYENA and RANCH epidemiological studies.

References

Fastl, H. & Zwicker, E. (2007). Psychoacoustics: Facts and Models (3rd ed.). Springer-Verlag.
Cousson, P.Y. et al. (2024). Effects of aircraft noise exposure on self-reported health through aircraft noise annoyance. Environmental Research. PMC11349086
McKinley, R. et al. (2023). Sound quality metric indicators of rotorcraft noise annoyance. JASA, 153(2), 867.
Caillet, G. et al. (2016). Aircraft noise annoyance modelling. Applied Acoustics, 111, 253–263.
Berglund, B. et al. (1995). Community Noise. WHO, Geneva.
Munzel, T. et al. (2018). Adverse effects of environmental noise on oxidative stress and cardiovascular risk. Antioxidants & Redox Signaling, 28(9), 873–908.
Dratva, J. et al. (2016). Cardiovascular and stress responses to short-term noise exposures. Environment International, 97, 224–233.

Section 7

Validation and field calibration

Reference meter field calibration

The Wintact SLM-30B reference meter (IEC 61672 Class 2) is field-calibrated to 94.0 dBA at the start of each comparison session using an AZ8930 acoustic calibrator (IEC 60942:2018 Class 2). A pre-session calibration log is maintained. The calibration chain is independently traceable, verified per session, and documented for audit.

Overflight comparison validation — Standardized Receptor

Following the 1 June 2026 calibration correction, opportunistic simultaneous comparisons against the AZ8930-verified Wintact SLM-30B have been conducted across multiple sessions. The Standardized Receptor dataset now covers three aircraft across two altitudes and three aircraft families:

Aircraft	Altitude	App (dBA)	SLM-30B (dBA)	Delta
B767 takeoff	—	76.5	76.6	−0.1 dB ✓
A321 takeoff	2,425 ft	78.2	78.6	−0.4 dB ✓
757-200 takeoff	4,650 ft	69.3	69.1	+0.2 dB ✓
SR mean (3 aircraft)				−0.1 dB · range 0.6 dB

The 0.6 dB range across three aircraft families and two altitudes is notably tight. The 757-200 at 4,650 ft is the first non-negative post-correction delta, suggesting the mean is stable near zero rather than consistently biased in one direction at Standardized Receptor. Whether the +0.2 dB reflects a spectral effect (the 757-200 has a different LF character), altitude-dependent geometry, or statistical variation cannot be determined from three samples — the dataset will continue to grow.

Expanded validation — Façade-Level geometry (8 June 2026)

A simultaneous eight-aircraft comparison was performed at a Façade-Level measurement position (0.5–2 m from a single building wall), yielding the following dataset:

Aircraft	App (dBA)	SLM-30B (dBA)	Delta
A321 takeoff	73.9	74.7	−0.8
737 Max 8 takeoff	73.0	73.9	−0.9
737 Max 8 takeoff	73.5	74.6	−1.1
A321 takeoff	74.1	74.7	−0.6
737 Max 8 takeoff	72.3	72.7	−0.4
737-900 takeoff	77.6	78.3	−0.7
737-800 takeoff	74.5	75.3	−0.8
EMB 505 (Rwy 15L)	59.4	60.3	−0.9
Façade mean (8 aircraft)			−0.78 dB · range 0.7 dB

All eight deltas are negative — the app reads consistently below the reference meter at this geometry. The probability of 8/8 same-direction results from random variation alone is approximately 0.8%, confirming a real systematic signal. The bias direction is conservative: the app under-reports relative to the reference meter. Any challenge to the dataset on grounds of inflation is contradicted by this result.

Geometry comparison — what the two datasets reveal

Geometry	N	Mean Δ	Range	Interpretation
Standardized Receptor	3	−0.1 dB	0.6 dB	Mic + offset chain, free-field — minimal boundary contribution
Façade-Level	8	−0.78 dB	0.7 dB	Boundary interference field — capsule position in reflection pattern

The geometry effect (SR vs Façade: 0.7 dB) and the within-SR altitude effect (range 0.6 dB) are comparable in magnitude. This indicates that measurement position in the local reflection pattern matters more than aircraft altitude for the app-vs-meter delta. The +96 dB offset is validated for the isolated mic + offset chain; the Façade-Level bias is an environmental boundary effect, not a calibration offset.

Post-correction validation now spans 11 aircraft across 2 measurement geometries — B767, A321 (×3), 737-700, 737-800, 737-900, 737 Max 8 (×3), 757-200, EMB 505. Overall mean Δ ≈ −0.55 dB, range −1.1 to +0.2 dB. All within IEC 61672 Class 2 ±2 dB tolerance. In any context where the dataset is challenged: across 11 post-correction measurements the app reads on average 0.55 dB below a Class 2 reference meter.

† The 9 May 2026 A320 comparison was made against the TA657A before its +2.9 dB drift was identified. The reported agreement (−1.4 dB) was anchored to a drifted reference. Corrected interpretation: app read +1.5 dB high relative to true SPL. This reading does not support any calibration claim and is retained as a historical data point only. See Section 7b for the full calibration correction record.

Section 7b

Calibration history and the 2026-06-01 offset correction

On 1 June 2026, we verified our prior reference meter — a TA657A (IEC 61672 Class 2) — against the AZ8930 calibrator with a tight coupler seal. The TA657A read 96.9 dBA against the calibrator's 94.0 dBA tone: a +2.9 dB drift, outside its IEC 61672 Class 2 ±2 dB tolerance. The drift had propagated into the application's external-microphone calibration offset. After identifying the drift, we replaced the reference meter with the calibrator-verified Wintact SLM-30B and corrected the offset from +99 dB to +96 dB.

Aircraft	App reading	Reference reading	Delta	Status / Geometry
B767 takeoff	76.5 dBA	76.6 dBA	−0.1 dB ✓	Post-correction · Standardized Receptor
A321 takeoff · 2,425 ft	78.2 dBA	78.6 dBA	−0.4 dB ✓	Post-correction · Standardized Receptor
757-200 takeoff · 4,650 ft	69.3 dBA	69.1 dBA	+0.2 dB ✓	Post-correction · Standardized Receptor
A321 takeoff	73.9 dBA	74.7 dBA	−0.8 dB ✓	Post-correction · Façade-Level
737 Max 8 takeoff	73.0 dBA	73.9 dBA	−0.9 dB ✓	Post-correction · Façade-Level
737 Max 8 takeoff	73.5 dBA	74.6 dBA	−1.1 dB ✓	Post-correction · Façade-Level
A321 takeoff	74.1 dBA	74.7 dBA	−0.6 dB ✓	Post-correction · Façade-Level
737 Max 8 takeoff	72.3 dBA	72.7 dBA	−0.4 dB ✓	Post-correction · Façade-Level
737-900 takeoff	77.6 dBA	78.3 dBA	−0.7 dB ✓	Post-correction · Façade-Level
737-800 takeoff	74.5 dBA	75.3 dBA	−0.8 dB ✓	Post-correction · Façade-Level
EMB 505 takeoff (Rwy 15L)	59.4 dBA	60.3 dBA	−0.9 dB ✓	Post-correction · Façade-Level
A321 takeoff	79.4 dBA	76.3 dBA	+3.1 dB	Pre-correction — confirmed offset magnitude
737-700 takeoff	78.4 dBA	75.6 dBA	+2.8 dB	Pre-correction — confirmed offset magnitude

This is a strength, not a weakness. Identifying and correcting a 2.9 dB drift by verifying against a primary acoustic reference is exactly the chain-of-custody discipline regulatory-grade measurement requires. Most regulatory-grade noise datasets never verify their reference meters against a primary calibrator. The corrected dataset is now traceable to an IEC 60942:2018 Class 2 acoustic calibrator.

Chronological calibration record:

9 May 2026A320 approach comparison against TA657A (internal mic, home patio). Reported delta −1.4 dB. Corrected interpretation: app +1.5 dB high relative to true SPL. Reference meter subsequently found drifted +2.9 dB. Retained as historical data point only — does not support any calibration claim.
May 2026External microphone calibration offset anchored to TA657A via indoor bare-microphone match test. Offset set to +99 dB. Drift inherited unknowingly into all May 2026 measurements.
1 June 2026TA657A verified against AZ8930 with tight coupler seal. Reading: 96.9 dBA against 94.0 dBA reference — +2.9 dB drift confirmed, outside Class 2 ±2 dB tolerance. TA657A retired as reference instrument.
1 June 2026Wintact SLM-30B verified against AZ8930. Reading: 94.0 dBA — confirmed clean. SLM-30B adopted as primary reference meter.
1 June 2026Application external-mic offset corrected from +99 dB to +96 dB. Three-aircraft validation performed at standardized receptor. B767 post-correction delta: −0.1 dB. ✓
2 June 2026External microphone placed directly in AZ8930 calibrator with snug coupler fit; windshield setting changed to bare/None in app. App reading: 93.7 dBA against 94.0 dBA reference. Delta: −0.3 dB. Independent confirmation of the +96 dB offset at 1 kHz pure tone — validates the microphone + offset sub-chain in isolation from the windshield and aircraft signal characteristics.
8 June 2026Eight-aircraft simultaneous comparison at Façade-Level position against AZ8930-verified Wintact SLM-30B. Aircraft: A321 (×2), 737 Max 8 (×3), 737-900, 737-800, EMB 505. All 8 deltas negative (app below meter). Mean delta: −0.78 dB. Range: −0.4 to −1.1 dB. App reads conservatively at façade geometry — consistent with position-dependent reflection interference between non-co-located capsules. All within IEC 61672 Class 2 ±2 dB tolerance. ✓
8 June 2026Standardized Receptor dataset expanded to 3 aircraft: A321 at 2,425 ft (−0.4 dB) and 757-200 at 4,650 ft (+0.2 dB) added to B767 (−0.1 dB). SR mean Δ: −0.1 dB · range 0.6 dB. Geometry effect (SR vs Façade: 0.7 dB) comparable to altitude effect within SR (0.6 dB range) — position in local reflection pattern is the dominant variable. Post-correction validation dataset now spans 11 aircraft across 2 geometries. ✓

Affected data range: Measurements taken before 1 June 2026 carry a known +3 dB systematic bias in absolute SPL terms. Derived psychoacoustic metrics inherit a proportional bias. See Data Treatment for how pre-correction data is handled in analytics.

Pre-correction data is not invalid. Relative patterns, spectral character, aft-aspect geometry, atmospheric HF absorption signatures, and event-to-event comparisons within a single session are preserved. What changes is the absolute calibration of peak SPL claims and health threshold exceedance counts, which now use post-correction data only.

Section 8

Limitations

Honest disclosure is part of credibility. The following are known limitations of the current system:

1

The microphone is not IEC 61672 certified. It is a consumer omnidirectional condenser with a manufacturer-published frequency range but no individual calibration certificate. We rely on empirical validation against the AZ8930-verified Wintact SLM-30B. Post-correction comparisons fall within the ±2 dB IEC 61672 Class 2 unit-to-unit tolerance (B767 −0.1 dB). Pre-correction spectral characterization against the TA657A is under re-characterization. Windshield insertion loss has been confirmed by relative measurement, unaffected by the absolute calibration correction. The system's empirical confidence rests on the AZ8930 → SLM-30B → app chain of custody rather than manufacturer specification alone.

2

Post-correction validation dataset is still small. Current validation rests primarily on a single post-correction overflight (Boeing 767, −0.1 dB agreement) plus the multi-aircraft pre-correction dataset that quantified the calibration drift (A321 +3.1 dB, 737-700 +2.8 dB — both outside Class 2 tolerance, confirming the offset magnitude). The methodology is sound regardless of sample size, but the empirical confidence interval narrows as post-correction comparisons accumulate. We will publish all future comparisons, including ones that disagree.

3

Measurements are displayed as 2-second peak-hold dBA. The reported value is the maximum A-weighted SPL observed during each 2-second window — closest in character to LAFmax in IEC 61672 terms. We do not currently report continuous LAeq integration over arbitrary periods.

4

We measure at the receptor, not at a standardized distance from the source. This is deliberate — receptor-based exposure is the variable health research uses — but it means our absolute SPL values cannot be directly compared to source-emission certifications like ICAO Annex 16 Chapter 14 noise levels.

5

Wind affects readings. The fur windshield reduces wind contamination substantially, and the C-A Delta provides objective per-observation flagging. However, high-wind sessions (sustained winds above approximately 15 mph) may produce residual contamination that the windshield cannot fully suppress. All sessions note wind conditions; high-wind sessions are flagged in the dataset.

6

The internal microphone offset (+108 dB) has not yet been re-validated against the AZ8930-verified reference chain. The internal microphone is not used for primary field measurement — the external microphone is the reference for all field sessions. The internal mic is used only for indoor characterization work. Re-validation against the SLM-30B is planned.

Section 9

Summary and data availability

TrueNoise measures community aircraft noise using a calibrated smartphone-based system anchored to a primary acoustic reference (IEC 60942:2018 Class 2 calibrator), validated against a Class 2 reference meter that is itself field-verified against that calibrator before each comparison session, with the calibration validated empirically on multiple aircraft overflights including a post-correction agreement of −0.1 dB on a Boeing 767.

The system is not a substitute for a laboratory-grade measurement microphone. It is a measurement system whose empirical performance has been characterized against traceable references, whose limitations are documented, and whose data is published openly for independent evaluation.

Data availability

All session data is available for download at truenoise.org/data.html. The dataset includes raw observations, psychoacoustic metrics, ADS-B attribution, calibration epoch tags, and windshield configuration for every recorded event. Pre-correction data is tagged pre_2026_06_01; post-correction data is tagged post_2026_06_01.