Try searching by:
1. Category (Best $100 Headphones, Best 50 Inch TVs, …)
2. Brand (Sony, Samsung, LG, Bose, …)
3. Model (S405, QC 35 II, …)
4. Etc...

Get insider access

Test Bench 2.0

Changelog

Updated May 01, 2025 at 11:11 am

By Christopher Steward

Headphones 2.0 introduces an array of changes to both usages and tests to improve the usefulness of our test suite. This changelog looks different from our usual changelogs as it features commentary from our test designer, Pierre, explaining some of the rationale behind our decisions to help give you greater insight into the changes. We've also published an R&D article, specifically explaining our Cumulative Spectral Decay (CSD) measurements in greater detail.

What's Changed?

Test Group	Description
Sound Profile/Raw FR	New multi-curve tool with different response curves Sound signatures for quick classification/sorting within the table tool
Frequency Response Consistency	B&K Type 5128 measurements, using 3-5 passes + human passes for over-ears All passes are now visible on the graphing tool
Stereo Mismatch	Groups together amplitude, frequency, and phase mismatch tests Phase mismatch given in degrees
Group Delay	Now has its own test box. Results given in milliseconds
Cumulative Spectral Decay	CSD data presented as waterfall plot
PRTF	Replaces soundstage and removes acoustic space excitation and openness
Harmonic Distortion	Data displayed as an unweighted graph, weighted averages at 94 and 104 dB SPL Additional graph showing individual harmonics at both amplitudes
Electrical Aspects	Groups together impedance, sensitivity, and Bluetooth max SPL with an impedance graph
Breathability	Removes Breathability results
Usages/Performance Usages	Adds Audio Reproduction Accuracy, Noise Isolation, Frequency Response Consistency, and Microphone as performance usages Minor changes to Sports & Fitness, Office Work, Travel, Wired/Wireless Gaming usages
Ribbon	Adds Bass Amount, Treble Amount, and Sound Signature

You'll notice that we've rearranged the order in which our tests appear within each review. We wanted to ensure that our revamped sound tests had better visibility, so we moved these to the top of the review.

New Features

Multi-Target Graph Tool

We've added an interactive graph tool that can compensate a response to a range of targets. You'll notice that if you click on the Sound Profile, Raw FR, or Bass, Mid, and Treble compliance graphs to enlarge them, you can now view comparisons with other target curves validated on the B&K Type 5128 HATS, such as the Harman IEM target, the SoundGuys.com headphone target response curve, the SoundGuys.com studio curve, and a diffuse field response, among others.

Our new graphing tool shows the Sennheiser HD600's raw frequency response. — The Sennheiser HD 600's raw frequency response in our new graphing tool.

Test Designer Pierre's Notes: First, regarding the Multi-Target Interactive Graph tool, we want to credit our fellow headphone researchers and technical reviewers who contributed to this. It's a real collaborative effort. Knowledge is meant to be shared, and everybody wins if we all work together. We couldn't be more thankful. Here's a breakdown of those targets:

1. RTINGS.com Target Curve: Our in-house curve is designed to reflect what we believe offers the most balanced listening experience for a general audience. You can learn more about it in our previous headphone R&D article.
2. Harman IEM Target Curve: Developed based on extensive research into listener preferences, the Harman target curve is the most widely accepted reference for IEM measurements.
3. SoundGuys Curves: We feature both of the SoundGuys target curves: their Studio Curve and the Headphones Preference Curve. Their approach is to derive an ideal response based on a number of earbuds and headphones that have reference status and a high preference rating.
4. SenseLab/Aizu Target Curve: This curve was developed through research involving both Danish and Japanese listeners. Thirty-two headphones' sound signatures and documented targets were evaluated. The final target curve was derived as the average of the five highest-rated curves by 56 participants.
5. B&K 5128 Diffused Field Response: This one shouldn't be considered a target. It's the head-related transfer function of the Type 5128 head simulator when placed in a flat diffuse field. Although the ITU-R BS.708 standard proposed in 1990 stated that studio monitor headphones should be voiced according to a diffuse field HRTF, it has been since essentially disqualified, as most people wouldn't find this a desirable frequency response.

We chose these targets as they're researched and validated preference targets. While we recognize some other targets, most are essentially reviewer-specific preferences on a Tilt Value of the Diffuse Field response of the 5128, which has value, too. Dr. Sean Olive from Harman also conducted some studies on various target curves for the 5128 and presented a seminar on his findings at the latest edition of CanJam NYC. The conference was titled: 'New Headphone Target Curves Defined on the B&K 5128: How Different Do They Sound and Which One is Most Preferred?'

Harman happened to share our view in terms of validated target curves. In Harman's terms, validated means validated in controlled listening tests.

A screenshot of Harman's list of proposed targets for the B&K 5128. — Harman's list of proposed targets for the B&K 5128.

Test Designer Pierre's Notes: We've also followed the target used by Headphones.com in their reviews with great interest. Like us, they've been forerunners of measurements on the 5128 platform and have performed significant research. The way they present "preference bounds" calibrated to a diffuse field 5128 HRTF in the case of headphones and their JM-1 baseline for IEMs makes them incompatible with our graph tool. However, it certainly aligns with our approach that a balanced sound profile isn't just one single exact curve. Here's an overlay of our tool's curve selection and the Headphones.com preference target for headphones.

A graph featuring the Headphones.com preference bounds, along with all the curves we feature in our multi-curve tool. — An overlay of the headphones.com 'preference bounds' with all the curves available in our multi-curve tool.

Test Designer Pierre's Notes: As you can see, there's some convergence between technical reviewers' view of a balanced sound profile. The last Harman listening test and AES study on target curves for the B&K5128 also had more than one statistically equal preference target. These curves aren't all the same; they're audibly different, you can prefer one over the other, and we give you that choice. However, at this stage in headphone research, there isn't one that can be deemed universally preferred.

New Tests

Sound Profile

The Bose QuietComfort 45/QC 45 Wireless' Sound Signature, Bass, and Treble Amount.

The Sennheiser HD 560S' Sound Signature, Bass, and Treble Amount.

While the sound profile tests are mostly unchanged, we've added Sound Signatures to make it easier to sort headphones according to your preferences.

Sound Signature	Characteristics
Boosted Bass	Bass-forward sound with a de-emphasized treble range
V-Shaped	Excited sound, with prominent bass and treble
Warm	Emphasized bass and low-mids with rolled-off highs for a smoother sound
Flat	Follows a diffuse field sound target in the bass and treble
Balanced	Follows our target curve in the bass and treble
Bright	Emphasized and sharp treble, with a more recessed bass response
Elevated Mid-Range	Under-emphasized treble and bass response, prominent mids

Test Designer Pierre's Notes: Sound signature comparisons were designed to help you quickly assess whether a pair of headphones' sound profile will agree with your preferences. Those signatures are considered the general families of tonal balance. There's no good or bad here; rather, we're looking at the general bias of the sound profile. We're not trying to describe all the subtleties of the sonic palette. Think wide band, not band-specific colorations.

Some of you may wonder about the difference between 'Flat' and 'Balanced.' We consider a Balanced sound signature to be when headphones' frequency response generally follows researched preference headphone targets. Flat generally refers to a non-elevated and extended bass region and a balanced treble response.

Frequency Response Consistency

We've expanded the scope and accuracy of our Frequency Response Consistency test. We now use the B&K Type 5128 testing head for our baseline measurements. For IEMs, we collect the frequency response after three re-seats. For on-ears and over-ears, we perform five passes on the testing head, but the bass and mid regions are also measured with a canal-blocking in-ear microphone on humans with various physical features: long-haired individuals, glasses wearers, and small, medium, and large heads. We gray out our measurements above 2k due to the increased variance and difference in human hearing perception in this range.

A graph showing the frequency response consistency of the Sony WH-1000XM5 when looking only at passes where people had long hair. — The Sony WH-1000XM5 Wireless' frequency response consistency.

Enlarging the graph allows you to see an averaged response for each human subject, as well as the individual passes. This can provide valuable insight for users on a more personalized basis. For example, users with glasses can see how consistently their headphones can deliver audio between multiple re-seats, so you can better understand the deviations from the overall average of all measurements.

Test Designer Pierre's notes: We wanted to portray a key element in headphones' frequency response measurements with this updated test. We already know that the frequency response of headphones measured on different artificial heads will look slightly different. However, it goes further: the nice, lightly-smoothed frequency response curve that you use to evaluate headphones' tonal characteristics is, in fact, an average and, to some extent, an approximation.

For earbuds, the insertion depth will cause discrepancies, mainly in the treble, and the coupling of the IEM tip with the silicone artificial ear will also cause discrepancies. For over-ears and on-ears, the seal is the main source of deviations, but any small differences in placement will interact with the artificial pinnae differently.

What you hear is slightly different than what someone else might hear and is different from what is captured by the ear simulator of our test rig. We felt that the old approach of merging the average responses of the human passes in the bass and using only the HATS for the mid and treble regions wasn't giving you the full picture. For over-ears and on-ears, you get a total of 20 measurements and five averages for both stereo channels. That's a whopping 50 curves to look at. Unfortunately, we can't capture frequency response on human subjects for IEMs and earbuds since it requires in-ear microphones.

You will also note that we only publish the human-measured response below 2000Hz. It's not that the results above that range aren't valid for a canal-blocking microphone, but they can't be coherently compared on the same graph as our HATS measurements that don't have the blocked canal.

Looking at this variability reinforces our belief that finding a single exact "perfect" target curve is a fruitless endeavor to some extent, as the differences we get from the same headphones are in the same order of magnitude as the difference in those published and researched targets.

Stereo Mismatch

While this test isn't entirely new, it represents a new formulation of the tests previously displayed in the Imaging box. We wanted to group all mismatches between L/R drivers within one test box, and this now includes a separate graph for frequency mismatch. We also measure phase shift in both directions and represent this on the graph tool.

Group Delay

Group delay has now been given its own test box, too, with a measurement for the weighted group delay value given in milliseconds.

Cumulative Spectral Decay

We're introducing an unscored CSD graph that plots frequency response against both amplitude and time. We derive this waterfall plot from the impulse response function in Audio Precision, using a time window that allows us to test Bluetooth headphones, as well as traditional analog headphones, using the same methodology.

A graph showing the HiFiMan Sundara 2020's CSD. — The HiFiMan Sundara 2020's CSD graph.

Test Designer Pierre's notes: Cumulative Spectral Decay for headphones is a polarizing subject, and its relevance can rightfully be questioned. Not only can the way it's derived vary wildly from one published source to the next, but it can also be argued that the sole addition of a time axis on the acoustic space that represents the coupling of headphones to a human head is peculiar to a certain degree. There's no real relationship between the time decay in milliseconds and the actual acoustical sustain time. It's purely a function of the observation window. In our case, 512 samples at 44.1 kHz.

Many of you requested this addition. We'll share our results for now, but we'd love your feedback! We're publishing them to get the pulse of our community on these three-dimensional representations of the impulse response for headphones. Are CSD graphs useful for you? Are they misleading? Do you believe that sustained resonances viewed on a CSD graph can be audible in everyday content? Read more about our CSD measurements in our R&D Article and let us know in the comments.

PRTF

Like Stereo Mismatch, this test isn't entirely new but was re-formulated as we felt it didn't represent a complete assessment of what constitutes soundstage. While our method of comparing the pinna-related transfer function of headphones with that of an angled reference speaker allowed us to draw some correlations with headphones' spatial qualities, we're aware of its limitations and shortcomings in this regard. That said, we've decided to keep the test in the form of PRTF while removing Openness and Acoustic Space Excitation, as we still feel that it can offer partial insight into the spaciousness of the headphones' soundstage.

Harmonic Distortion

We've revised our testing methodologies and the way we showcase our data. Using the B&K 5128 connected to our Audio Precision AP517b Analyzer, we now take distortion measurements at both 94dBSPL and 104dBSPL to match current industry standards.

We display our data as an unweighted THD graph, as well as in the form of weighted values at both 94 and 104dBSPL. For our weighted values, higher harmonics are given more weight using the weighting coefficient -n²/4. These harmonics are then A-weighted against frequency to account for the relative loudness perceived by the human ear before an average is found.

Finally, for those who'd prefer to see an extra step between the unweighted THD and the weighted values, we also provide a graph illustrating the second and third harmonics at 94BSPL and 104dBSPL. We include the A-weighting curve to further demonstrate how headphones' distortion performance translates to the weighted averages.

A graph showing the HiFiMan Edition XS's harmonic distortion measurements. — The HiFiMan Edition XS' harmonic distortion measurements.

Electrical Aspects

This test groups together passive impedance, sensitivity, and Bluetooth max SPL. There's also an impedance graph (for analog headphones). This information can help you assess your amplification needs.

A graph showing the impedance of the Beyerdynamic DT 770 PRO. — The Electrical Aspects section provides detail and insight into the Beyerdynamic DT 770 PRO's impedance and sensitivity.

A graph showing the impedance of the Audeze MM-100. — The Electrical Aspects section provides detail and insight into the Beyerdynamic DT 770 PRO's impedance and sensitivity.

Breathability

We've removed Breathability results from our reviews. A degree of variation between the left and right ears of our breathability rig produced results that we felt weren't representative of the headphones' actual performance.

Verdicts and Usages

We've removed and/or updated all of our verdicts. You'll likely immediately notice that Neutral Sound has also been removed. While some aspects of it live on within the Audio Reproduction Accuracy performance usage, we decided we wanted to place greater emphasis on user preference, given the difficulty in defining neutral sound and the inability of multiple studies to arrive upon a single target curve that represents the preferences of the average listener. There have also been changes to the composition of the Travel, Sports And Fitness, Office Work, and Wired/Wireless Gaming (In Development) usages, with Noise Isolation and Microphone (In Development) added as performance usages alongside Audio Reproduction Accuracy and Frequency Response Consistency. The usages marked as 'In Development' are largely unchanged from 1.8 in their composition, as we're looking to reformulate them soon.

Audio Reproduction Accuracy: This usage replaces Neutral Sound and indicates fidelity in audio reproduction.

The top headphones we've tested so far, ranked according to Audio Reproduction Accuracy.

The score components of Audio Reproduction Accuracy. — The top headphones we've tested so far, ranked according to Audio Reproduction Accuracy.

Test Designer Pierre's Notes: Regarding the removal of the neutral sound score in favor of Audio Reproduction Accuracy, as mentioned above, we're now focusing more strongly on objective metrics in headphones' audio reproduction. You can have your own preferred bias on the tonality of headphones, and that's totally valid. However, there's no debating that headphones with significant resonances or valleys in the response, or with audible discrepancies between both sides of the stereo image, perform worse and will degrade the overall listening experience more than headphones with a smooth response and a perfect balance between left and right.

Peaks And Dips and Stereo Mismatch take the lion's share of our Audio Reproduction Accuracy score. Since the statistics and research are validated, we still chose to include our compliance to target evaluations. The body of research can demonstrate that a balanced sound signature is, of course, more likely to sound good to the average person. We still see that as a measure of performance. Distortion and Group delay are also included, but as these performance metrics are often difficult to hear, they're given a comparatively modest weight.

Noise Isolation: This usage gives immediate insight into a pair of headphones' ability to attenuate environmental noises from the listening experience. In our evaluation, we also factor in how well the headphones minimize leakage.

Microphone (In Development): This indicates how well the microphone reproduces your voice in any environment. We're currently in the process of improving our evaluation of integrated microphones, but we recognize that microphone performance is an important aspect of various headphones' usages.

Test Designer Pierre's Notes: Revamping microphone testing is at the very top of our pile of to-dos. Stay tuned! A revamped suite of tests for microphone evaluation is coming.

Frequency Response Consistency: We wanted to further highlight the importance of Frequency Response Consistency as much as possible and to demonstrate the impact it can have on our frequency response measurements. This usage gives an overview of how factors like your physical features or the headphones' design can affect the frequency response.

There have also been smaller changes to other usages. We've tweaked the weighting of some of the tests that make up our Sports And Fitness, Travel, Office Work, Wired Gaming, and Wireless Gaming usages, as well as some of the verbiage. As with the Microphone performance usage, we've flagged the two gaming usages as 'In Development'; we're still working on a formulation that encompasses everything users will want from a wired or wireless gaming headset. If you have feedback on how to shape these, comment below!

Changes To The Ribbon

We've reshaped the ribbon found at the top of the review so you can quickly identify important specifications and technical details that can assist you in making a buying decision. You can also filter according to these specifications with the table tool. We added the following information to the ribbon:

Bass and Treble Amount
Sound Signature

The revised ribbon section now includes a brief overview of the headphones' key characteristics.

Let Us Know What You Think!

There's a lot of information to take in here. If there's something you want us to consider, investigate, or add to our test benches, let us know in the comments or ask us a question at our Discord AMA on May 1st.

41 Headphones Updated So Far

We are retesting popular models first. So far, the test results for the following models have been converted to the new testing methodology. However, the text might be inconsistent with the new results.

74 Headphones Planned To Be Updated

We are also planning to retest the following products over the course of the next few weeks:

Comments

Article

Active 7 hours ago

· • Posted 5 months ago

Test Bench: Improvement to sound tests: Main Discussion

What do you think of our article? Let us know below.

Want to learn more? Check out our complete list of articles and tests on the R&D page.

PreviewBack to editorFormat guide

Sort by:

newest first

Usage	Test Bench 1.8	Test Bench 2.0
Sports And Fitness
Travel
Office Work
Wired Gaming
Wireless Gaming

Test Bench 2.0 Changelog

What's Changed?

New Features

Multi-Target Graph Tool

New Tests

Sound Profile

Frequency Response Consistency

Stereo Mismatch

Group Delay

Cumulative Spectral Decay

PRTF

Harmonic Distortion

Electrical Aspects

Breathability

Verdicts and Usages

Changes To The Ribbon

Let Us Know What You Think!

41 Headphones Updated So Far

74 Headphones Planned To Be Updated

Comments

Test Bench: Improvement to sound tests: Main Discussion

Test Bench 2.0

Changelog