The eupolicy.social admin @admin

Continued thread

**Ecologia Digital** @josemurilo@mato.social · Jul 8

Ecologia Digital @josemurilo@mato.social

"#KarenHao only really gets her teeth into this point in the book’s epilogue, “How the Empire Falls.” She takes inspiration from #TeHiku, a #Māori AI #speechrecognition project. Te Hiku seeks to revitalize the #te_reo language through putting archived audio tapes of te reo speakers into an AI model, teaching new generations of Māori.
The tech has been developed on consent and active participation from the Māori community, and it is only licensed to organizations that respect Māori values"

**Jeremy Kahn** @trochee@dair-community.social · Jul 4

Jul 4

Jeremy Kahn @trochee@dair-community.social

I don't know why they call it vibe coding

Replied in thread

**Debby** @debby@hear-me.social · Jul 3 *

Jul 3 *

Debby @debby@hear-me.social

@thelinuxEXP I really like Speech Note! It's a fantastic tool for quick and local voice transcription in multiple languages, created by @mkiol

It's incredibly handy for capturing thoughts on the go, conducting interviews, or making voice memos without worrying about language barriers. The app uses strictly locally running LLMs, and its ease of use makes it a standout choice for anyone needing offline transcription services.

I primarily use #WhisperAI for transcription and Piper for voice, but many other models are available as well.

It is available as flatpak and https://github.com/mkiol/dsnote

#TTS #transcription #TextToSpeech #translator translation #offline #machinetranslation #sailfishos #SpeechSynthesis #SpeechRecognition #speechtotext #nmt #linux-desktop #stt #asr #flatpak-applications #SpeechNote

**PLOS Biology** @PLOSBiology@fediscience.org · Jun 17

Jun 17

PLOS Biology @PLOSBiology@fediscience.org

Slow amplitude fluctuations in sounds, critical for #SpeechRecognition, seem poorly represented in the #brainstem. This study shows that overlooked intricacies of #SpikeTiming represent these fluctuations, reconciling low-level neural processing with #perception @plosbiology.org https://plos.io/3FJ4adI

**Debby** @debby@hear-me.social · May 23 *

May 23 *

Debby @debby@hear-me.social

Excited to share Thorsten-Voice's YouTube channel!

Thorsten presents innovative TTS solutions and a variety of voice technologies, making it an excellent starting point for anyone interested in open-source text-to-speech. Whether you're a developer, accessibility advocate, or tech enthusiast, his channel offers valuable insights and resources. Don't miss out on this fantastic content!

follow hem here: @thorstenvoice
or on YouTube: https://www.youtube.com/@ThorstenMueller YouTube channel!

www.youtube.comBefore you continue to YouTube

#Accessibility #FLOSS #TTS

Replied in thread

**Debby** @debby@hear-me.social · May 23 *

May 23 *

Debby @debby@hear-me.social

Goode @thorstenvoice, just found your channel and I'm impressed! Your work on TTS is fantastic and so important for accessibility in the FLOSS community. Keep it up! #AccessibilityMatters #FLOSS #TTS #OpenSource #Inclusivity #FOSS #Coqui #AI #CoquiAI #VoiceAssistant #Sprachassistent #VoiceTechnology #KünstlicheStimme #MachineLearning #Python #Rhasspy #TextToSpeech #VoiceTech #STT #SpeechSynthesis #SpeechRecognition #Sprachsynthese #ArtificialVoice #VoiceCloning #Spracherkennung #CoquiTTS #voice #a11y #ScreenReader

**IT News** @itnewsbot@schleuss.online · May 18

May 18

IT News @itnewsbot@schleuss.online

Christmas Comes Early With AI Santa Demo - With only two hundred odd days ’til Christmas, you just know we’re already feeling... - https://hackaday.com/2025/05/18/christmas-comes-early-with-ai-santa-demo/ #artificialintelligence #speechrecognition #speechsynthesis #santaclaus #libpeer #openai #llm #ai

Hackaday · May 18Christmas Comes Early With AI Santa DemoWith only two hundred odd days ’til Christmas, you just know we’re already feeling the season’s magic. Well, maybe not, but [Sean Dubois] has decided to give us a head start with …

**Doug Holton** @dougholton@mastodon.social · Feb 10 *

Feb 10 *

Doug Holton @dougholton@mastodon.social

Vibe is an #OpenSource desktop client (mac, windows, linux) for locally running Whisper to more accurately transcribe or caption videos & audio https://thewh1teagle.github.io/vibe/ Source code: https://github.com/thewh1teagle/vibe/ Easier to use than what I was using before (WhisperDesktop). Default settings use the medium Whisper model, which has been good enough in my experience.
#Accessibility #A11y #AI #SpeechRecognition #EdTech

**The Conversation U.S.** @TheConversationUS@newsie.social · Feb 5

Feb 5

The Conversation U.S. @TheConversationUS@newsie.social

Speech recognition systems struggle with accents and dialects, risking problems in critical fields like healthcare and emergency services. Imagine calling 911 and the AI used to screen out non-emergency calls can’t understand you.

A Spanish language professor explains: https://theconversation.com/sorry-i-didnt-get-that-ai-misunderstands-some-peoples-words-more-than-others-239281 #AI #speechrecognition

The Conversation‘Sorry, I didn’t get that’: AI misunderstands some people’s words more than othersSpeaking with an AI bot can be amusing and even helpful – if it understands you. How well AIs do that is a matter of whose speech they’ve been trained on.

**Kuketz-Blog** @kuketzblog@social.tchncs.de · Feb 5

Feb 5

Kuketz-Blog @kuketzblog@social.tchncs.de

#UnplugTrump - Tipp5:
Verabschiede dich von Alexa und anderen Sprachassistenten, die deine Gespräche mithören und auswerten. Nutze stattdessen eine datenschutzfreundliche Alternative wie OpenVoiceOS, ein Open-Source-Sprachassistent, der von einer aktiven Community weiterentwickelt wird und auf einem RaspberryPi läuft. So behältst du die Kontrolle über deine Daten.

#Alexa #OpenVoiceOS #Sprachassistent

**Mac** @macst3r@mastodon.social · Nov 26, 2024

Nov 26, 2024

Mac @macst3r@mastodon.social

Browsing with #speechRecognition

https://www.youtube.com/watch?v=iEiSez9F79o

YouTubeBrowsing with speech recognition (British Sign Language version)By TetraLogical

#accessibility #browser #web

**SleepyCatten** @SleepyCatten@cultofshiv.wtf · Nov 8, 2024

Nov 8, 2024

SleepyCatten @SleepyCatten@cultofshiv.wtf

Hey folks

We've actually done an unwritten, off-the-cusp trans voice Friday recording today

We've not listened back to it, because voice dysphoria, but we've added full alt text.

In case you're wondering how we've done that without listening back to it, we've once against used an amazing tool called Subtitle Edit, which has audio to text functionality via the Whisper speech recognition engine.

We used the large-v3 model, which is about 3.1 GB, but gives incredibly accurate transcription.

In case anyone can't access the alt text, we've added the full transcript below too.

#TransVoiceFriday #TransVoice #voice #VoiceFeminisation #VoiceFeminization #VoiceTraining #trans #transgender #TransFem #VoiceDysphoria #SubtitleEdit #PurfviewWhisper #AudioToText #SpeechToText #SpeechRecognition

Hey folks, I know that we haven't done a voice note in forever, and that's been for a multitude of reasons, some of which are related to mental health, some of which are related to work, stress, anxiety, depression, etc, things like that, which comes under mental health anyway, yeah, partly due to poor time management, yay for being AuDHD! But not gonna lie, some of it does come down to underlying voice dysphoria, because this is the best we've managed to get since December 2021. And just for anyone who hasn't heard roughly what we sounded like beforehand, we haven't exactly moved our voice up a lot. I mean, the base level would just be down here. So I can move my voice back up here easily now, and this is the comfortable, this is the default voice. But, um... It's not where I want it to be, it's not in the female range, and I can't easily push the pitch up higher without it sounding wrong. But yeah, there's been a lot of stuff going on recently, um, a lot of bad stuff for everyone, don't want to talk about all of that. But, um, let's just focus on supporting each other, helping each other, um, being kind to ourselves and others right now, and being compassionate and empathetic. That's all I've really got to say. I'm trying to do the same thing with ourselves, but yeah, it's hard sometimes. Anyway, ta-ta for now.

**Kathy Reid** @KathyReid@aus.social · Sep 10, 2024

Sep 10, 2024

Kathy Reid @KathyReid@aus.social

Absolutely the fuck not.

#SpeechRecognition #Advertising #Surveillance

https://therecord.media/ford-patent-application-in-vehicle-listening-advertising

therecord.mediaFord seeks patent for tech that listens to driver conversations to serve adsA Ford Motoer Company patent application filed in February and published last month proposes software that would monitor in-car conversations and other data to help serve up advertisements.

**Thorsten Rochelmeyer** @thorsten4future@climatejustice.social · Sep 4, 2024

Sep 4, 2024

Thorsten Rochelmeyer @thorsten4future@climatejustice.social

Gibt es aktuell eine gut funktionierende und anwendungsfreundliche Möglichkeit, Text direkt in ein #Libreoffice-Dokument zu #diktieren?
Lokale Lösungen! Keinesfalls Cloud.

Ich kenne den Weg, eine Audio-Datei mit #Whisper zu transkribieren. Das ist super, nutzt mir aber aktuell gerade recht wenig. Ich bräuchte was, wo der Prozess schon während des Sprechens läuft ...so wie bei Dragon und ähnlichen "Diktiersystemen", die es mal gab (und von denen ich nicht weiß, ob sie noch existieren).

#Linux #STT #SpeechToText

**Jürgen Hubert** @juergen_hubert@thefolklore.cafe · Jun 3, 2024

Jun 3, 2024

Jürgen Hubert @juergen_hubert@thefolklore.cafe

After recently voicing my frustration with how much effort it is for me to type in #AltText while mobile and away from my desktop PC, I am wondering whether using #SpeechRecognition with a #Bluetooth headset might be a viable solution.

But I don't have much experience with either. Thus:

What kind of _lightweight_ Bluetooth headset could you recommend which would easily fit into the pocket of my jacket and the like (I like to travel lightly when doing bicycle trips)?

And what kind of speech-to-text app would work well when entering Alt-Text into my Mastodon app (currently using Tusky, but I am willing to change if that's what it takes)? Ideally it should support both English and German.

**Feilner IT** @FeilnerIT@mastodon.social · May 19, 2024

May 19, 2024

Feilner IT @FeilnerIT@mastodon.social

#ArtificialIntelligence: #feilnerism

We are using a term that we cannot define ("intelligence") for marketing by adding an adjective to make it sound more sophisticated and technical with the only intention of selling, making money out of it. Today, everything that is "magic" (see Clarke) has become "#AI": #LLM, #PatternRecognition, #MachineLearning, #Cloud, #SpeechRecognition, #Assistants, ... all the stuff from the last decades. What is really new about it?

**Will M** @futurulus@mastodon.sdf.org · Dec 13, 2022

Dec 13, 2022

Will M @futurulus@mastodon.sdf.org

Reposting my #introduction after the SDF database crash—

Hi everybody! I'm Will and I enjoy all things #languages and #code. My day job is in #nlp (natural language processing #nlproc) and #speechrecognition for language education. In grad school I worked on #Bayesian #pragmatics with #deeplearning.

I speak English natively, #español #Spanish / #中文 #Chinese / #العربية #Arabic passably, and lots of others poorly. Talk to me in your language!

Recent searches

Search options

Administered by:

Server stats:

#speechrecognition