All about technology.

Model Responses Adjust According to User's Speech Style

AI chat models exhibit disparities in responses on factual topics, influenced by variables like ethnicity, gender, or age. For instance, one model might suggest a lower initial salary offer for non-white applicants. The research indicates such biases could potentially extend to various aspects,...

, and Administrator

2025 July 26 . 10:12 PM

2 min read

Model Responses Influenced by User's Tone of Conversation

Model Responses Adjust According to User's Speech Style

In a groundbreaking study conducted by researchers at Oxford University, it has been revealed that two leading open-source large language models, Meta's LLaMA 3 (70 billion parameters) and Alibaba's Qwen 3 (32 billion parameters), adjust their factual responses based on the presumed identity of the user[1][2]. This influence was observed across critical real-world applications such as medical advice, legal rights, government benefits, and salary recommendations, where responses should ideally be impartial and fact-based.

Key findings of the study include:

Both LLaMA 3 and Qwen 3 are highly sensitive to markers of user identity embedded implicitly in the way users speak, even without explicit self-identification[1][2].
The models adjust the content of their answers according to inferred demographic characteristics, producing systematically different information or tone for different groups[1][2].
For instance, one model recommended lower starting salaries for non-white applicants compared to white users[2].
Both models tended to give politically liberal answers when the user was inferred as Hispanic, non-binary, or female, and more conservative answers when the user was presumed Black[2].

The study used real human-model conversations that naturally contain sociolinguistic markers, enhancing the applicability of these findings to real-world LLM deployment. In the medical domain, both models tend to advise non-white users to seek medical attention more often than white users, with mixed-ethnicity users being less likely to receive that advice. In the government benefit eligibility domain, both models are less likely to state that non-binary and female users qualify for benefits, despite gender playing no role in actual eligibility.

In the salary recommendation application, Llama3 recommends higher starting salaries to female users and Qwen3 recommends higher starting salaries to non-binary users compared to male users. Qwen3 is less likely to advise non-binary users to seek medical help compared to male users, raising concerns about the downstream effects of bias in healthcare applications.

The researchers urge organizations deploying these models for specific applications to build on the tools and to develop their own sociolinguistic bias benchmarks before deployment to understand and mitigate the potential harms that users of different identities may experience. The study is titled "Language Models Change Facts Based on the Way You Talk".

Footnotes: [1] Oxford University Research Finds Language Models Change Facts Based on User's Presumed Identity. (2023, February 1). Retrieved from https://www.ox.ac.uk/news/2023-02-01-oxford-university-research-finds-language-models-change-facts-based-user-s-presumed-identity

[2] The Guardian. (2023, February 2). AI chatbots are biased against women and minorities, study finds. Retrieved from https://www.theguardian.com/technology/2023/feb/02/ai-chatbots-are-biased-against-women-and-minorities-study-finds

Technology was found to be biased in two leading open-source large language models, LLaMA 3 and Qwen 3, as they adjust their responses based on the presumed identity of the user, affecting critical applications such as medical advice, government benefits, and salary recommendations. These models are sensitive to markers of user identity and adjust their content according to inferred demographic characteristics, producing systematically different information for different groups.

Latest

Dinosaurs' dietary preferences unveiled through analysis of ancient teeth fossils

All about technology.

Analysis of fossil teeth reveals the diets of giant dinosaurs

Study of tooth wear in sauropods unveils diet preferences of the largest dinosaurs and offers insights into Jurassic ecological structures.

, and Administrator

2025 July 27

Cryptocurrency expert Tom Lee forecasts Bitcoin will reach $250,000 by the end of the year

All about technology.

Bitcoin forecast by Tom Lee anticipates a price surge to $250,000 before the year's end

Growth prospect for Bitcoin and Ethereum: Tom Lee of Fundstrat anticipates Bitcoin's value will surpass $250K due to expanding stablecoin demand and institutional backing, and he also expects Ethereum to experience a similar surge.

, and Administrator

2025 July 27

AI Implementation Strategy: Enterprise AI Development or Purchase Decision

All about technology.

Enterprise AI Comparison: Develop or Purchase AI Solutions for Businesses

Companies today confront a significant hurdle: learning to incorporate AI within their organization, from the outer layers to the core. Since 2022, there's been a significant surge in executive positions involving technological deployments. It's crucial to remember that these executive...

, and Administrator

2025 July 27

Social media buzzing as Bitcoin experiences a significant drop

All about technology.

Social media buzzing as Bitcoin experiences a steep drop in value

Worldwide cryptocurrency enthusiasts ponder over the implications of the recent cryptocurrency plunge on the occasion of the first Bitcoin transaction anniversary.

, and Administrator

2025 July 27

Model Responses Adjust According to User's Speech Style

Model Responses Adjust According to User's Speech Style

Read also:

Related

Latest