this post was submitted on 04 Aug 2025
18 points (100.0% liked)

Linguistics

1350 readers
1 users here now

Welcome to the community about the science of human Language!

Everyone is welcome here: from laypeople to professionals, Historical linguists to discourse analysts, structuralists to generativists.

Rules:

  1. Instance rules apply.
  2. Be reasonable, constructive, and conductive to discussion.
  3. Stay on-topic, specially for more divisive subjects. And avoid unnecessary mentioning topics and individuals prone to derail the discussion.
  4. Post sources when reasonable to do so. And when sharing links to paywalled content, provide either a short summary of the content or a freely accessible archive link.
  5. Avoid crack theories and pseudoscientific claims.
  6. Have fun!

Related communities:

Resources:

Grammar Watch - contains descriptions of the grammars of multiple languages, from the whole world.

founded 2 years ago
MODERATORS
 

Archive link: https://archive.is/20240503184140/https://www.science.org/content/article/human-speech-may-have-universal-transmission-rate-39-bits-second

Interesting excerpt:

De Boer agrees that our brains are the bottleneck. But, he says, instead of being limited by how quickly we can process information by listening, we're likely limited by how quickly we can gather our thoughts. That's because, he says, the average person can listen to audio recordings sped up to about 120%—and still have no problems with comprehension. "It really seems that the bottleneck is in putting the ideas together."

Ah, here's a link to the paper!

you are viewing a single comment's thread
view the rest of the comments
[–] rbn@sopuli.xyz 6 points 2 days ago (11 children)

They found that Japanese, which has only 643 syllables, had an information density of about 5 bits per syllable, whereas English, with its 6949 syllables, had a density of just over 7 bits per syllable. Vietnamese, with its complex system of six tones (each of which can further differentiate a syllable), topped the charts at 8 bits per syllable.

That's the part I don't get. How do you determine the bits of information per syllable/word in different languages?

If I pick a random word such as 'sandwich' and encode it in ASCII it takes 8 bytes / i.e. 64 bits. According to the scientists, a two-syllable word in English only holds 14 bits of actual information. Anyone understands what they did there or has access to the underlying study?

[–] themoonisacheese@sh.itjust.works 14 points 2 days ago (9 children)

You've stumbled upon the dark arts of information theory.

Sure, conveying "sandwich" in ascii or utf-8 takes 64 bits of information, but that's in an encoding that is by default inefficient.

For starters, ascii has a lot of unprintables that we normally don't really use to write words. Even if we never use these characters, they take up bits in our encoding because every time we don't use them, we specify that we're using other characters.

Second, writing and speaking are 2 different things. If you think about it, asking a question isn't actually a separate ("?") character. In speech, asking a question is just a modification of tone, and order of words, on a sentence. While, as literate people, we might think of sentence as written, the truth is that speech doesn't have such a thing as question marks. The same is true of all punctuation marks. Therefore, a normal English sentence also encodes information about the tone of the sentence, including tones we don't really know how to specify in text, and all of that is information.

This is the linguistic equivalent of kolmogorov complexity which explores the absolute lowest amount of data required to represent something, which in effect requires devising the most efficient possible data encoding scheme.

[–] rbn@sopuli.xyz 2 points 2 days ago (2 children)

Thanks a lot for these insights, much appreciated!

asking a question isn't actually a separate ("?") character. In speech, asking a question is just a modification of tone

But if modification of tone encodes additional information wouldn't we need to consider that as additional bits?

So if 'You need a taxi.' and 'You need a taxi?' are two different things, I don't think we can just skip punctuation when measuring the bits of information in a sentence.

[–] lvxferre@mander.xyz 2 points 2 days ago (2 children)

Theoretically speaking you do need to take phrasal tone into account, but in practice the difference is negligible - because most languages reinforce questions through syntactical and/or lexical means - particles, pronouns, subject-verb inversion, etc.

[–] rbn@sopuli.xyz 4 points 2 days ago (1 children)

I'll for sure dig a bit deeper on the links, but for me it's still very counter intuitive to estimate information density of spoken word just by the count of syllables.

E.g. I can vary the sentence 'I need help' in so many ways. I can mumble it to a close sitting person to imply secrecy, I can say it in a desperate voice to show psychological distress, I can increase the volume to indicate urgency etc. And all that doesn't even consider body language, mimics etc. which are all part of the information flow. And I'd guess that body language varies a lot from country to country.

[–] lvxferre@mander.xyz 3 points 2 days ago

...ah. The rabbit hole of paralinguistic information - all those bits of info that aren't part of the language itself, but still found alongside it. It's a big deal as you noticed, but really hard to quantify, so I don't blame the authors for leaving it off.

[–] ViatorOmnium@piefed.social 3 points 2 days ago (1 children)

But lot of very common ones like Spanish and Portuguese don't. The difference between a statement and a question is exclusively the tone in both.

[–] lvxferre@mander.xyz 1 points 1 day ago

That's only for yes/no questions. Open-ended questions start with a pronoun in both, as typical for Indo-European languages. Portuguese example:

  1. A cor do cavalo é cinza. // the colour of-the horse is grey.
  2. Qual é a cor do cavalo? // which is the colour of-the horse?
  3. Qual que é a cor do cavalo? // which that/what is the colour of-the horse?

#2 is the standard way to phrase a question, but #3 is really common in informal speech.

And colloquially sometimes you even see yes/no questions getting some "random" emphatic word, like:

  • A cor do cavalo é cinza, ? // the colour of-the horse is grey, innit?
  • Ma[s] a cor do cavalo é cinza? // but the colour of-the horse is grey?

They do change the nature of the question slightly (the first one sounds rhetoric, the second one as if there was conflicting info), but the main reason they're added is to reinforce the phrasal tone as a question marker.

Yes, exactly. This is information that's encoded by tone, and it is accounted for in the 7 bits per syllable (or lack of syllable, for periods for example). It was more of an example to show how if what you're conveying is assumed to always be speech, the encoding you can use can be much more efficient.

On that note, a thing if forgot to mention is that speech assumes that what will be said is pretty much always valid. For example, sure, ascii has a lot more information density at 8 bits per character as you point out, but in reality it's capable of encoding things like "hsuuia75hs". If you tried communicating this to someone over speech, you'd find that the average speed you can do this drops dramatically from the normal 7 bits/syllable, where the ascii used in my comment's text has been constant-speed. That's one of the trade-offs.

load more comments (6 replies)
load more comments (7 replies)