mastodon.gamedev.place is one of the many independent Mastodon servers you can use to participate in the fediverse.
Mastodon server focused on game development and related topics.

Server stats:

5.1K
active users

#albanian

0 posts0 participants0 posts today

Realities We Share - Evaluation launch #London #Event "In the shadows of the Illegal #Migration Act and the #Rwanda project, seeing an increase in #hostile policy, practice and rhetoric targeting #asylum seekers, and in the context of a crisis in the provision of legal aid #representation across the #immigration and asylum sector, a small project, Breaking the Chains, sought to improve outcomes for #Albanian #children and young people seeking asylum." Fri, 27 Sep eventbrite.co.uk/e/realities-w

EventbriteRealities We ShareEvaluation launch

Each quarter, when the new @mozilla #CommonVoice #dataset is released, I do a #dataviz using @observablehq of its #metadata coverage, across all 100+ languages, based on the JSON summary that is part of the release.

Some of my observations from the v18 release are:

💡 #Catalan (ca) now has a larger dataset than English, based on the number of audio recordings (including validated and yet-to-be-validated recordings). It’s also an interesting dataset because the number of recordings per unique contributor is relatively low (around 80). This means it’s likely to have a high diversity of speakers in the dataset, which is useful for building #ASR models that generalise well to many speakers.

Catalan also appears to have the highest percentage of audio recordings by older speakers - e.g. speakers in their forties, fifties and older. Again, this highlights the diversity of speakers in the Catalan dataset.

💡 Although it’s very early to see any trends from the decision by Common Voice to expand the range of options for gender identity, we are starting to see some data being tagged with the new options that are available. For example, in #Uyghur (ug), we now have data tagged as “do not wish to say”. I don’t want to draw connections between the geopolitical situation in that area and the desire of data contributors not to provide demographic data which may in some way identify them without more evidence, but I think it’s telling that the first use of these expanded metadata categories appears in a language that is spoken in a contested geography.

💡Similarly, it’s very early to identify trends in sentence domain classification - as most of the sentences that do have a domain tag are labelled “general”, although “health_care” sentences are occurring frequently in languages such as #Albanian (sq).

💡#Bangla (Bengali) (bn) continues to have a very large number of yet-to-be-validated audio recordings. Due to this, the train split for Bangla is quite small.

💡#Dholuo (luo), a language spoken in Kenya and Tanzania, is an outlier in terms of the number of distinct data contributors to the dataset - this language has a very high average number of contributions for per contributor. This is often seen in languages that are new to Common Voice, before they have been able to recruit more contributors. Dholuo has nearly 5 million speakers.

💡 The language with the highest average utterance duration is by far #Icelandic (is) at over 7 seconds. This may be because Icelandic has many words with several syllables, which take longer to pronounce. Consider "the cat sat on the mat" in English, cf "kötturinn sat á mottunni" in Icelandic.

Big thanks to all data contributors in this release for your donated utterances, and to Dmitrij Feller, @jessie, Gina Moape, EM Lewis-Jong and the team for all your efforts.

What are your thoughts? What conclusions do you draw?

observablehq.com/@kathyreid/mo

Audio Etymologies of the Day

“Wear” comes from Proto-Indo-European *wos, like this (listen):
🔈ancientsounds.net/eastern-orig

*wos is a form of *wes, which also developed into Sanskrit वस्ते vaste:
🔈ancientsounds.net/eastern-orig

and Albanian vesh:
🔈ancientsounds.net/eastern-orig

and Latin vestis:
🔈ancientsounds.net/eastern-orig

which was borrowed into English as "vest" and in "vestments".

an #introduction !
Apparently people do that in the fediverse..

=========

Andromeda
21+ y/o
leftist liberal socialist
🏳️‍🌈 Queer : Enby, Ace and Sapphic
🇦🇱 Albanian
🚫 Ex-muslim
🧠 Probably™️ neurotypical
🎮 I like video games.
✊ Punch Nazis and ACAB.
🐧 VR and Linux enthusiast (not good at it tho)
🌌 Space enthusiast

Topics you probably shouldn't talk to me about:
🍎 Apple
Religion, specifically Islam


Some tags that fit me:
#3dmodelling #blender #queer #albanian #politics #socialism #leftist #atheist #linux #antifa #asexual #nerd #virtualreality

So uhh hi I guess. I'm a tad shy and I'm just here to socialize.

Used to be right leaning and I think it's important that people are aware. Used to be bigoted and now proudly stand here. Disowned my own family after I realized I was indoctrinated with Islamic bullshit.