Population-Aligned Audio Reproduction With LLM-Based Equalizers

January 14, 2026

5 authors

arXiv:2601.09448v1

Authors

Ioannis StylianouJon FrancombePablo Martinez-NuevoSven Ewan ShepstoneZheng-Hua Tan

Abstract

Conventional audio equalization is a static process that requires manual and cumbersome adjustments to adapt to changing listening contexts (e.g., mood, location, or social setting). In this paper, we introduce a Large Language Model (LLM)-based alternative that maps natural language text prompts to equalization settings. This enables a conversational approach to sound system control. By utilizing data collected from a controlled listening experiment, our models exploit in-context learning and parameter-efficient fine-tuning techniques to reliably align with population-preferred equalization settings. Our evaluation methods, which leverage distributional metrics that capture users' varied preferences, show statistically significant improvements in distributional alignment over random sampling and static preset baselines. These results indicate that LLMs could function as "artificial equalizers," contributing to the development of more accessible, context-aware, and expert-level audio tuning methods.

Paper Information

arXiv ID:: 2601.09448v1
Published:: January 14, 2026
Categories:: cs.SD, cs.AI

Population-Aligned Audio Reproduction With LLM-Based Equalizers

Authors

Abstract

Paper Information

Related Papers

Large Language Models for Code Generation

Diffusion Models for High-Resolution Image Synthesis