lvxferre

joined 2 years ago
MODERATOR OF
[–] lvxferre@mander.xyz 8 points 4 hours ago

Remember Torvalds flipping the bird, and telling a corporation to fuck itself? That was NVidia. And that was why he did it.

(I had NVidia GPUs through my whole life, except the last one - an AMD. I'm glad to have switched.)

Anyway, meldrik's answer should work fine.

[–] lvxferre@mander.xyz 2 points 5 hours ago (1 children)

You can't spit it out after you already swallowed it! And the ones I'm used to don't make any sort of hard shell, they simply wiggle their way into the fruit.

Not that it's a big deal - if they caused any harm, I'd be dead already.

[–] lvxferre@mander.xyz 1 points 5 hours ago

That's Hacker News in a nutshell.

[–] lvxferre@mander.xyz 1 points 5 hours ago

o rly.

These people were using ✨agentic✨ AI. They were using ✨thinking✨ models

The A"I" industry is full of those misleading buzzwords, isn't it?

"Agentic" would be theoretically "able to take decisions by itself, like an agent", but in practice the word is spammed so often in this context that it lost its meaning.

"Thinking"? Look, you can push and pull definitions as much as you want, but those models don't think. Nor do people who claim otherwise.

So are the AI-posters lying or what?
I think the AI-posters are a mix of the following, in order of least to most malevolent: Outright Malice\

Stop trying to gauge "intentions" (whatever this means). Focus on the fact that they're vomiting certainty on something that is blatantly incorrect.

Eventually every vibe coder reaches the point where the returns start heavily diminishing.

From my experience as translator, translating things is easier than proofreading. I expect the same to apply to code - writing code to be easier than reviewing and maintaining it. (What programmers in Lemmy often say reinforce this for me.)

[–] lvxferre@mander.xyz 3 points 7 hours ago

What if I'm my own why-foo?

[–] lvxferre@mander.xyz 4 points 7 hours ago

I wonder if that isn't a protection against mould reaching the seeds.

[–] lvxferre@mander.xyz 8 points 8 hours ago (3 children)

Yup! The ones in guava are bigger though, far more visible, and people always joke the worst part isn't finding a worm there, but half a worm. (Just a wee bit of protein~)

[–] lvxferre@mander.xyz 8 points 8 hours ago (5 children)

Anyone who eats guava straight from the tree probably ate far more maggots through their life than any H. neanderthalensis did. Including myself.

...damn, I miss that guava tree.

[–] lvxferre@mander.xyz 2 points 16 hours ago* (last edited 16 hours ago)

Interesting… I hadn’t considered that this might enable linguistic “shorthands”, is that the implication?

Yes, it is! Agreement on its own already allows some "shorthands", if you're able to omit the nouns; but derivation in special allows a lot of them, because it allows you to cram more info into the word at the "cost" of 1~3 phonemes.

I'll give you some examples of that, using Portuguese for my own convenience; do note however you'll see similar stuff popping up in other gendered languages.

First example:

1a. O relógio (M) caiu sobre a mesa (F), e ele (M) quebrou.
1b. O relógio (M) caiu sobre a mesa (F), e ela (F) quebrou.

Both sentences mean "the clock fell over the table, and it broke", but the "ele" (he/it) in 1a refers to the clock, and the "ela" (she/it) in 1b to the table. By changing the gender of the pronoun, you can force it to refer to one or another noun, in a rather succinct way you wouldn't be able to do in a non-gendered language like English. (I feel like "it" would refer to the clock, as the agent of the first phrase, and if you want to refer to the table breaking you'd need to repeat the noun.)

Of course, this "shorthand" only works if both nouns happen to have different genders, but it's already enough to cram a bit more info per syllable. In other cases people use the same strategies as in English.

Second example:

2. Pedro tem dois gatos: uma (F) frajola (F or M) e um (M) malhado (M).

Translated directly, this sentence becomes "Peter has two cats: a tuxedo and a tabby". However the translation doesn't mention the tuxedo is a female, and the tabby a male. In a non-gendered language you'd need to either ditch those pieces of info or explicitly refer to them, and that takes more words.

I’d be interested in follow up studies to examine emergent linguistic patterns. Can we weigh syllabic encoding by common usage by age? If we eliminate “thouest” from the dictionary but include “skibidi” how does that skew patterns for informational density?

The impact for an individual word would be fairly minimal, I think. However, if you're systematically changing sounds or the grammar, like languages often do (cue to "want to", "going to", "trying to" → "wanna", "gonna", "tryna"), the impact will be fairly high. And likely compensated elsewhere, to keep the bits/second ratio roughly the same.

Science is so fucking cool and I’m stoked that people nerd out on shit that I’m an idiot about so I can learn about the nature of the world.

And the fun part is that everybody is an idiot for most topics, except a few individual expertises. We're basically a race of clueless apes trying to make sense of the world.

[–] lvxferre@mander.xyz 2 points 1 day ago

Relevant to note that, if you ask 1000 people how they split media genres, you'll get 1000 different answers. There's no "true" answer, and making a list that encompasses all genres is impossible.

That said I feel like this video should be a good intro for people who aren't into anime yet.

[–] lvxferre@mander.xyz 8 points 1 day ago

I think we should all burn an effigy in commitment to safe microwaves ovens.

[–] lvxferre@mander.xyz 2 points 1 day ago (2 children)

Also, just because a syllable “encodes more bits on average” does it imply faster transmission rate?

If by "faster" you're measuring:

  • the transmission per syllable - then yes
  • the transmission per second - then no

This is easier to see in the original paper than in the OP. Check page 3; the second column is the rate of transmission per second, it's roughly 35~45 bits/s for all of them.

Just because French encodes gender information into it’s language and syllables, isn’t knowing the gender of a shovel at best “check bits?” Used for detecting transmission errors but not intrinsically critical data?

At least in theory, redundancy required by [gender, number, case, etc.] agreement shouldn't count, as it isn't adding new information - it's only repeating info already provided. In practice it's hard to model this, so the numbers for gendered languages might be a bit overestimated.

Note however gender has a second role, besides agreement: derivation. Derivation should actually increase bits/second, since it allows you to convey succinctly some stuff.

I’m fascinated by the assertion that it’s easy to establish “bits per second” on syllables having somehow abstracted away social context.

The social context (and the context, as a whole) plays a huge role on that, as paralinguistic information. However the scope there is only the linguistic info, encoded by the language itself.

 

Archive link: https://archive.is/20240503184140/https://www.science.org/content/article/human-speech-may-have-universal-transmission-rate-39-bits-second

Interesting excerpt:

De Boer agrees that our brains are the bottleneck. But, he says, instead of being limited by how quickly we can process information by listening, we're likely limited by how quickly we can gather our thoughts. That's because, he says, the average person can listen to audio recordings sped up to about 120%—and still have no problems with comprehension. "It really seems that the bottleneck is in putting the ideas together."

Ah, here's a link to the paper!

 

I regret not posting it before Canvas 2025, but hopefully it'll be useful for people playing it in 2026. All letters are 5 pixels tall, and most 3 pixels wide (some 4, a few 5). I've also included a few Cyrillic letters and the digits.

I tried to make it even smaller, but it gets really funky.

 

Interesting video on the stone that allowed researchers to decipher Ancient Egyptian. Check comments for a few notes.

 

Additional links with press coverage: ArcheologyMag, Oxford.

For context:

The Huns were nomadic people from Central Eurasia; known for displacing a bunch of Iranian (e.g. Alans) and and Germanic (e.g. Goths, Suebians etc.) speakers, that ultimately invaded the Roman Empire. They reached the Volga around 370 CE, and one of their leaders (Attila) is specially famous. Often believed to be a Turkic people, but if the study is correct they're from a completely different language family instead.

The Xiōng-Nú are mentioned by Chinese sources as one of the "Five Barbarians" (i.e. non-Han people). They would've lived in Central Eurasia between 300 BCE and 100 CE or so, and eventually became Han tributaries.

The Paleo-Siberian language in question would be an older form of Arin, a Yeniseian language. Yup, that same family believed by some to have relatives in the Americas.

 

For further info, if anyone is interested, Stephen Bax claimed a decade ago to partially decode the manuscript; here's a video with his reasoning, as well as the paper he released. Sadly Bax passed away in 2017 (may he rest in peace), so the work was left incomplete.

 

The main idea behind this language is to become evolutionary food for other languages of my conworld. As such I'll probably never flesh it out completely, only the necessary to make its descendants feel a bit more natural.

Constructive criticism is welcome.

Context and basic info

The conworld I'm building has three classical languages, spoken 2~3 millenniums before the conworld present: Old Sirtki, Classical Tarune, and Mäkşna. And scholars in the conworld present are reconstructing their common ancestor, that they call "Proto-Sitama".

What I'm sharing here, however is none of their fancy reconstructions. It's the phonology of the language as it was spoken 7 millenniums before the conworld present. Its native name was /kʲær.mi.'zɑst/, or roughly "what we speak"; the language itself had no written version but it'll be romanised here as ⟨Cjermizást⟩.

Its native speakers were a semi-nomadic people, who lived mostly of livestock herding. They'd stay in a region with their herds, collect local fruits and vegetables, and then migrate for more suitable pasture as their animals required.

It was quite a departure from the lifestyle of their star travelling ancestors, who were born in a highly industrialised society in another planet.

Grammar tidbits

Grammar-wise, Cjermizást was heavily agglutinative, with an absolutive-ergative alignment and Suffixaufnahme. So typically you'd see few long polymorphemic words per sentence. Those morphemes don't always "stack" nicely together, so you often see phonemes being elided, mutated, or added to the word.

Consonants

Manner \ Set Hard Soft
Nasals /m n/ /mʲ ɲ/
Voiceless stop /p t k/ /pʲ tʲ kʲ/
Voiced stop /b d g/ /bʲ dʲ gʲ/
Voiceless fric. /ɸ s x/ /fʲ ʃ ç/
Voiced fric. /w z ɣ/ /vʲ ʒ j/
Liquids /l r/ /ʎ rʲ/

Cjermizást features a contrast between "soft" and "hard" consonants. "Soft" consonants are palatalised, palatal, or post-alveolar; "hard" consonants cannot have any of those features. Both sets are phonemic, and all those consonants can surface outside clusters.

Palatalised consonants spawn a really short [j], that can be distinguished from true /j/ by length.

Although /j/ and /w/ are phonetically approximants, the language's phonology handles them as fricatives, being paired with /ɣ/ and /vʲ/ respectively.

/r rʲ/ surface as trills or taps, in free variation. The trills are more typical in simple onsets, while the taps in complex onsets and coda.

The contrast between /m n/ is neutralised when preceding another consonant in the same word, since both can surface as [m n ŋ]; ditto for /mʲ nʲ/ surfacing as [mʲ ɱʲ ɲ].

Coda /g/ can also surface as [ŋ], but only in word final position; as such, it doesn't merge with the above.

Liquids clustered with voiceless fricatives and/or stops have voiceless allophones.

Vowels

Proto-Sitama's vowel system is a simple square: /æ i ɒ u/. They have a wide range of allophones, with three situations being noteworthy:

  • /ɒ u/ are typically fronted to [Œ ʉ] after a soft consonant
  • /æ i/ are backed to [ɐ ɪ] after a hard velar
  • unstressed vowels are slightly centralised

Accent

Accent surfaces as stress, and it's dictated by the following rules:

  1. Some suffixes have an intrinsic stress. If the word has 1+ of those, then assign the primary stress to the last one. Else, assign it to the last syllable of the root.
  2. If the primary stress fell on the 5th/7th/9th/etc.-to-last syllable, move it to the 3rd-to-last
  3. If the primary stress fell on the 4th/6th/8th/etc.-to-last syllable, move it to the 2nd-to-last.
  4. Every two syllables, counting from the one with the primary stress, add a secondary stress.

Phonotactics

Max syllable is CCVCC, with the following restrictions:

  • complex onset: [stop] + [liquid]; e.g. /pl/ is a valid onset, */pw/ isn't
  • complex coda: [liquid or nasal] + [stop or fricative]; e.g. /nz/ is a valid coda, */dz/ isn't

If morphology would create a syllable violating such structure, an epenthetic /i/ dissolves the cluster.

Consonant clusters cannot mix hard and soft consonants. When such a mix would be required by the morphology, the last consonant dictates if the whole cluster should be soft or hard, and other consonants are mutated into their counterparts from the other set. For example, */lpʲ/ and */ʃp/ would be mutated to /ʎpʲ/ and /sp/.

Stops and fricatives clustered together cannot mix voice. Similar to the above, the last consonant of the cluster dictates the voicing of the rest; e.g. */dk/ and */pz/ would be converted into /tk/ and /bz/ respectively.

Gemination is not allowed, and two identical consonants next to each other are simplified into a singleton. Nasal consonants are also forbidden from appearing next to each other, although a cluster like /nt.m/ would be still valid.

Word-internal hiatuses are dissolved with an epenthetic /z/. Between words most speakers use a non-phonemic [ʔ], but some use [z] even in word boundaries.

Romanisation

As mentioned at the start, the people who spoke Cjermizást didn't write their own language. As such the romanisation here is solely a convenience.

  • /m n p t b d g s x w z l r/ are romanised as in IPA
  • /k ɸ ɣ/ are romanised ⟨c f y⟩
  • "soft" consonants are romanised as their "hard" counterparts, plus ⟨j⟩
  • ⟨j⟩ is omitted inside clusters; e.g. /pʲʎ/ is romanised as ⟨plj⟩, not as *⟨pjlj⟩
  • /æ i ɒ u/ are ⟨e i a u⟩
 

Use this thread to ask questions or share trivia, if you don't want to create a new thread for that.

[Note: the purpose of this thread is to promote activity, not to concentrate it. So if you'd still rather post a new thread, by all means - go for it!]

 

Quick summary: a tablet written in Hittite, from a likely vassal to their king, recounts how Attaršiya [Atreus?] of Ahhiyawa [the Achaeans] and his sons attacked Taruiša [Troy]. And at the end there's a fragment in another Anatolian language, Luwian, saying the following:

wa-ar-ku-uš-ša-an ma-a-aš-ša-ni SÌ[R
wrath.ACC god(dess).VOC? si[ng

So roughly "Sing, oh goddess, the wrath..."

This is pretty much how the Illiad starts in Greek:

μῆνιν ἄειδε θεὰ Πηληϊάδεω Ἀχιλῆος
mênĭn áeide theā́ Pēlēïádeō Akhĭlêos
rage.ACC sing.IMP goddess.VOC Peleus.GEN Achilles.GEN
Sing, oh goddess, the rage of Achilles [son] of Peleus

 

Here's a direct link to the journal article.

Summary: phylogenomic study found that Hexapoda (insects, springtails, headcones) is a sister clade to Remipedia (venomous, cave-dwelling "crustaceans"). So it's basically the same that happened with birds and dinos, except with bugs.

15
submitted 4 months ago* (last edited 4 months ago) by lvxferre@mander.xyz to c/linguistics@mander.xyz
 

Feel free to use this thread to ask small questions or share random language / linguistics trivia, if you don't feel like creating a new thread just for that.

(Just to be clear: yes, if you want to create a new thread for your question/trivia, you can. I'm only trying to stimulate discussion in the comm.)

view more: next ›