eupolicy.social is one of the many independent Mastodon servers you can use to participate in the fediverse.
This Mastodon server is a friendly and respectful discussion space for people working in areas related to EU policy. When you request to create an account, please tell us something about you.

Server stats:

211
active users

#speechrecognition

0 posts0 participants0 posts today
Continued thread

"#KarenHao only really gets her teeth into this point in the book’s epilogue, “How the Empire Falls.” She takes inspiration from #TeHiku, a #Māori AI #speechrecognition project. Te Hiku seeks to revitalize the #te_reo language through putting archived audio tapes of te reo speakers into an AI model, teaching new generations of Māori.
The tech has been developed on consent and active participation from the Māori community, and it is only licensed to organizations that respect Māori values"

Replied in thread

@thelinuxEXP I really like Speech Note! It's a fantastic tool for quick and local voice transcription in multiple languages, created by @mkiol

It's incredibly handy for capturing thoughts on the go, conducting interviews, or making voice memos without worrying about language barriers. The app uses strictly locally running LLMs, and its ease of use makes it a standout choice for anyone needing offline transcription services.

I primarily use #WhisperAI for transcription and Piper for voice, but many other models are available as well.

It is available as flatpak and github.com/mkiol/dsnote

#TTS #transcription #TextToSpeech #translator translation #offline #machinetranslation #sailfishos #SpeechSynthesis #SpeechRecognition #speechtotext #nmt #linux-desktop #stt #asr #flatpak-applications #SpeechNote

🌟 Excited to share Thorsten-Voice's YouTube channel! 🎥 🗣️🔊 ♿ 💬

Thorsten presents innovative TTS solutions and a variety of voice technologies, making it an excellent starting point for anyone interested in open-source text-to-speech. Whether you're a developer, accessibility advocate, or tech enthusiast, his channel offers valuable insights and resources. Don't miss out on this fantastic content! 🎬

follow hem here: @thorstenvoice
or on YouTube: youtube.com/@ThorstenMueller YouTube channel!

www.youtube.comBefore you continue to YouTube

#UnplugTrump - Tipp5:
Verabschiede dich von Alexa und anderen Sprachassistenten, die deine Gespräche mithören und auswerten. Nutze stattdessen eine datenschutzfreundliche Alternative wie OpenVoiceOS, ein Open-Source-Sprachassistent, der von einer aktiven Community weiterentwickelt wird und auf einem RaspberryPi läuft. So behältst du die Kontrolle über deine Daten.

Hey folks :FediverseSymbol:

We've actually done an unwritten, off-the-cusp trans voice Friday recording today :TransHeart:

We've not listened back to it, because voice dysphoria, but we've added full alt text.

In case you're wondering how we've done that without listening back to it, we've once against used an amazing tool called Subtitle Edit, which has audio to text functionality via the Whisper speech recognition engine.

We used the large-v3 model, which is about 3.1 GB, but gives incredibly accurate transcription.

In case anyone can't access the alt text, we've added the full transcript below too.

#TransVoiceFriday #TransVoice #voice #VoiceFeminisation #VoiceFeminization #VoiceTraining #trans #transgender #TransFem #VoiceDysphoria #SubtitleEdit #PurfviewWhisper #AudioToText #SpeechToText #SpeechRecognition

Hey folks, I know that we haven't done a voice note in forever, and that's been for a multitude of reasons, some of which are related to mental health, some of which are related to work, stress, anxiety, depression, etc, things like that, which comes under mental health anyway, yeah, partly due to poor time management, yay for being AuDHD! But not gonna lie, some of it does come down to underlying voice dysphoria, because this is the best we've managed to get since December 2021. And just for anyone who hasn't heard roughly what we sounded like beforehand, we haven't exactly moved our voice up a lot. I mean, the base level would just be down here. So I can move my voice back up here easily now, and this is the comfortable, this is the default voice. But, um... It's not where I want it to be, it's not in the female range, and I can't easily push the pitch up higher without it sounding wrong. But yeah, there's been a lot of stuff going on recently, um, a lot of bad stuff for everyone, don't want to talk about all of that. But, um, let's just focus on supporting each other, helping each other, um, being kind to ourselves and others right now, and being compassionate and empathetic. That's all I've really got to say. I'm trying to do the same thing with ourselves, but yeah, it's hard sometimes. Anyway, ta-ta for now.

Gibt es aktuell eine gut funktionierende und anwendungsfreundliche Möglichkeit, Text direkt in ein #Libreoffice-Dokument zu #diktieren?
Lokale Lösungen! Keinesfalls Cloud.

Ich kenne den Weg, eine Audio-Datei mit #Whisper zu transkribieren. Das ist super, nutzt mir aber aktuell gerade recht wenig. Ich bräuchte was, wo der Prozess schon während des Sprechens läuft ...so wie bei Dragon und ähnlichen "Diktiersystemen", die es mal gab (und von denen ich nicht weiß, ob sie noch existieren).

After recently voicing my frustration with how much effort it is for me to type in #AltText while mobile and away from my desktop PC, I am wondering whether using #SpeechRecognition with a #Bluetooth headset might be a viable solution.

But I don't have much experience with either. Thus:

What kind of _lightweight_ Bluetooth headset could you recommend which would easily fit into the pocket of my jacket and the like (I like to travel lightly when doing bicycle trips)?

And what kind of speech-to-text app would work well when entering Alt-Text into my Mastodon app (currently using Tusky, but I am willing to change if that's what it takes)? Ideally it should support both English and German.

#ArtificialIntelligence: #feilnerism

We are using a term that we cannot define ("intelligence") for marketing by adding an adjective to make it sound more sophisticated and technical with the only intention of selling, making money out of it. Today, everything that is "magic" (see Clarke) has become "#AI": #LLM, #PatternRecognition, #MachineLearning, #Cloud, #SpeechRecognition, #Assistants, ... all the stuff from the last decades. What is really new about it?

Reposting my #introduction after the SDF database crash—

Hi everybody! I'm Will and I enjoy all things #languages and #code. My day job is in #nlp (natural language processing #nlproc) and #speechrecognition for language education. In grad school I worked on #Bayesian #pragmatics with #deeplearning.

I speak English natively, #español #Spanish / #中文 #Chinese / #العربية #Arabic passably, and lots of others poorly. Talk to me in your language!