eupolicy.social is one of the many independent Mastodon servers you can use to participate in the fediverse.
This Mastodon server is a friendly and respectful discussion space for people working in areas related to EU policy. When you request to create an account, please tell us something about you.

Server stats:

205
active users

#speechrecognition

1 post1 participant0 posts today
Jürgen Hubert<p>Tech question: Is there a <a href="https://mementomori.social/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechRecognition</span></a> software that:</p><p>1) works for generating <a href="https://mementomori.social/tags/video" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>video</span></a> captions (I would like to use them for <a href="https://mementomori.social/tags/PeerTube" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>PeerTube</span></a> videos)<br>2) is trainable<br>3) is <a href="https://mementomori.social/tags/FOSS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>FOSS</span></a> ?</p>
adrienandrem<p>Quel outil de Reconnaissance automatique de la parole (speech to text) respectueux de la vie privée et hors-ligne recommandez-vous ?<br><a href="https://pouet.chapril.org/tags/ReconnaissanceAutomatiqueDeLaParole" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ReconnaissanceAutomatiqueDeLaParole</span></a> <a href="https://pouet.chapril.org/tags/speechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>speechRecognition</span></a></p>
Joseph<p>Another failed attempt at rolling out <a href="https://mastodon.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> into the real-world: <a href="https://www.wsj.com/articles/taco-bell-rethinks-future-of-voice-ai-at-the-drive-through-72990b5a" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">wsj.com/articles/taco-bell-ret</span><span class="invisible">hinks-future-of-voice-ai-at-the-drive-through-72990b5a</span></a></p><p>One might actually think that ordering from a drive-through is a very constrained and well-defined problem, such that it would lend itself easily to <a href="https://mastodon.social/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechRecognition</span></a> or so.</p><p>But even the fast food world turns out to be more open and complex than current-gen AI can handle.</p>
AskUbuntu<p>Speech to Text extension for real-time transcription in your browser — any good ones? <a href="https://ubuntu.social/tags/speechrecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>speechrecognition</span></a></p><p><a href="https://askubuntu.com/q/1554138/612" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">askubuntu.com/q/1554138/612</span><span class="invisible"></span></a></p>
Niavy :verified: :bearn:<p>Il y a quelque temps, dans un fil sur les apps open source, je me suis fait rembarrer en parlant de Whisper comme alternative à la synthèse vocale Google, au motif que ça appartiendrait à OpenAI. Ça vous dit quelque chose ?</p><p>Pour le moment j'utilise avec plaisir SherpaTTS, mais bon, je suis curieux.</p><p>En plus, la description de Whisper+ semble indiquer que l'application est capable de traduire à la volée de la langue parlée vers l'anglais ! <span class="h-card" translate="no"><a href="https://masto.bike/@Vive_Levant" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>Vive_Levant</span></a></span> </p><p><a href="https://masto.bike/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechRecognition</span></a><br><a href="https://masto.bike/tags/OpenAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>OpenAI</span></a><br><a href="https://masto.bike/tags/TTS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TTS</span></a><br><a href="https://masto.bike/tags/FOSS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>FOSS</span></a></p>
Data Quine<p>Journal of Open Source Software: voice: A Comprehensive R Package for Audio Analysis <br>{voice}<br>"...a free, open-source toolkit designed to streamline audio analysis by integrating music theory and advanced computational techniques. It enables researchers to extract, summarize, and analyze voice data efficiently, supporting applications such as speech recognition, speaker identification, and mood inference..."</p><p><a href="https://joss.theoj.org/papers/10.21105/joss.08420" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">joss.theoj.org/papers/10.21105</span><span class="invisible">/joss.08420</span></a></p><p><a href="https://datasci.social/tags/RStats" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>RStats</span></a> <a href="https://datasci.social/tags/Audio" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Audio</span></a> <a href="https://datasci.social/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechRecognition</span></a> <a href="https://datasci.social/tags/AudioAnalysis" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AudioAnalysis</span></a> <a href="https://datasci.social/tags/Speech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Speech</span></a></p>
Ecologia Digital<p>"<a href="https://mato.social/tags/KarenHao" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>KarenHao</span></a> only really gets her teeth into this point in the book’s epilogue, “How the Empire Falls.” She takes inspiration from <a href="https://mato.social/tags/TeHiku" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TeHiku</span></a>, a <a href="https://mato.social/tags/M%C4%81ori" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Māori</span></a> AI <a href="https://mato.social/tags/speechrecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>speechrecognition</span></a> project. Te Hiku seeks to revitalize the <a href="https://mato.social/tags/te_reo" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>te_reo</span></a> language through putting archived audio tapes of te reo speakers into an AI model, teaching new generations of Māori.<br>The tech has been developed on consent and active participation from the Māori community, and it is only licensed to organizations that respect Māori values"</p>
Jeremy KahnI don't know why they call it vibe coding
Debby ‬⁂📎🐧:disability_flag:<p><span class="h-card" translate="no"><a href="https://mastodon.social/@thelinuxEXP" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>thelinuxEXP</span></a></span> I really like Speech Note! It's a fantastic tool for quick and local voice transcription in multiple languages, created by <span class="h-card" translate="no"><a href="https://mastodon.social/@mkiol" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>mkiol</span></a></span> </p><p>It's incredibly handy for capturing thoughts on the go, conducting interviews, or making voice memos without worrying about language barriers. The app uses strictly locally running LLMs, and its ease of use makes it a standout choice for anyone needing offline transcription services.</p><p>I primarily use <a href="https://hear-me.social/tags/WhisperAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>WhisperAI</span></a> for transcription and Piper for voice, but many other models are available as well. </p><p>It is available as flatpak and <a href="https://github.com/mkiol/dsnote" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">github.com/mkiol/dsnote</span><span class="invisible"></span></a> </p><p><a href="https://hear-me.social/tags/TTS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TTS</span></a> <a href="https://hear-me.social/tags/transcription" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>transcription</span></a> <a href="https://hear-me.social/tags/TextToSpeech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TextToSpeech</span></a> <a href="https://hear-me.social/tags/translator" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>translator</span></a> translation <a href="https://hear-me.social/tags/offline" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>offline</span></a> <a href="https://hear-me.social/tags/machinetranslation" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>machinetranslation</span></a> <a href="https://hear-me.social/tags/sailfishos" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>sailfishos</span></a> <a href="https://hear-me.social/tags/SpeechSynthesis" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechSynthesis</span></a> <a href="https://hear-me.social/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechRecognition</span></a> <a href="https://hear-me.social/tags/speechtotext" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>speechtotext</span></a> <a href="https://hear-me.social/tags/nmt" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>nmt</span></a> <a href="https://hear-me.social/tags/linux" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>linux</span></a>-desktop <a href="https://hear-me.social/tags/stt" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>stt</span></a> <a href="https://hear-me.social/tags/asr" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>asr</span></a> <a href="https://hear-me.social/tags/flatpak" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>flatpak</span></a>-applications <a href="https://hear-me.social/tags/SpeechNote" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechNote</span></a></p>
PLOS Biology<p>Slow amplitude fluctuations in sounds, critical for <a href="https://fediscience.org/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechRecognition</span></a>, seem poorly represented in the <a href="https://fediscience.org/tags/brainstem" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>brainstem</span></a>. This study shows that overlooked intricacies of <a href="https://fediscience.org/tags/SpikeTiming" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpikeTiming</span></a> represent these fluctuations, reconciling low-level neural processing with <a href="https://fediscience.org/tags/perception" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>perception</span></a> @plosbiology.org 🧪 <a href="https://plos.io/3FJ4adI" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">plos.io/3FJ4adI</span><span class="invisible"></span></a></p>
Debby ‬⁂📎🐧:disability_flag:<p>🌟 Excited to share Thorsten-Voice's YouTube channel! 🎥 🗣️🔊 ♿ 💬</p><p>Thorsten presents innovative TTS solutions and a variety of voice technologies, making it an excellent starting point for anyone interested in open-source text-to-speech. Whether you're a developer, accessibility advocate, or tech enthusiast, his channel offers valuable insights and resources. Don't miss out on this fantastic content! 🎬</p><p>follow hem here: <span class="h-card" translate="no"><a href="https://techhub.social/@thorstenvoice" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>thorstenvoice</span></a></span> <br>or on YouTube: <a href="https://www.youtube.com/@ThorstenMueller" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="">youtube.com/@ThorstenMueller</span><span class="invisible"></span></a> YouTube channel! </p><p><a href="https://hear-me.social/tags/Accessibility" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Accessibility</span></a> <a href="https://hear-me.social/tags/FLOSS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>FLOSS</span></a> <a href="https://hear-me.social/tags/TTS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TTS</span></a> <a href="https://hear-me.social/tags/ParlerTTS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ParlerTTS</span></a> <a href="https://hear-me.social/tags/OpenSource" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>OpenSource</span></a> <a href="https://hear-me.social/tags/VoiceTech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VoiceTech</span></a> <a href="https://hear-me.social/tags/TextToSpeech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TextToSpeech</span></a> <a href="https://hear-me.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://hear-me.social/tags/CoquiAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>CoquiAI</span></a> <a href="https://hear-me.social/tags/VoiceAssistant" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VoiceAssistant</span></a> <a href="https://hear-me.social/tags/Sprachassistent" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Sprachassistent</span></a> <a href="https://hear-me.social/tags/MachineLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>MachineLearning</span></a> <a href="https://hear-me.social/tags/AccessibilityMatters" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AccessibilityMatters</span></a> <a href="https://hear-me.social/tags/FLOSS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>FLOSS</span></a> <a href="https://hear-me.social/tags/TTS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TTS</span></a> <a href="https://hear-me.social/tags/OpenSource" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>OpenSource</span></a> <a href="https://hear-me.social/tags/Inclusivity" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Inclusivity</span></a> <a href="https://hear-me.social/tags/FOSS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>FOSS</span></a> <a href="https://hear-me.social/tags/Coqui" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Coqui</span></a> <a href="https://hear-me.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://hear-me.social/tags/CoquiAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>CoquiAI</span></a> <a href="https://hear-me.social/tags/VoiceAssistant" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VoiceAssistant</span></a> <a href="https://hear-me.social/tags/Sprachassistent" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Sprachassistent</span></a> <a href="https://hear-me.social/tags/VoiceTechnology" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VoiceTechnology</span></a> <a href="https://hear-me.social/tags/K%C3%BCnstlicheStimme" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>KünstlicheStimme</span></a> <a href="https://hear-me.social/tags/MachineLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>MachineLearning</span></a> <a href="https://hear-me.social/tags/Python" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Python</span></a> <a href="https://hear-me.social/tags/Rhasspy" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Rhasspy</span></a> <a href="https://hear-me.social/tags/TextToSpeech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TextToSpeech</span></a> <a href="https://hear-me.social/tags/VoiceTech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VoiceTech</span></a> <a href="https://hear-me.social/tags/STT" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>STT</span></a> <a href="https://hear-me.social/tags/SpeechSynthesis" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechSynthesis</span></a> <a href="https://hear-me.social/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechRecognition</span></a> <a href="https://hear-me.social/tags/Sprachsynthese" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Sprachsynthese</span></a> <a href="https://hear-me.social/tags/ArtificialVoice" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ArtificialVoice</span></a> <a href="https://hear-me.social/tags/VoiceCloning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VoiceCloning</span></a> <a href="https://hear-me.social/tags/Spracherkennung" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Spracherkennung</span></a> <a href="https://hear-me.social/tags/CoquiTTS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>CoquiTTS</span></a> <a href="https://hear-me.social/tags/voice" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>voice</span></a> <a href="https://hear-me.social/tags/a11y" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>a11y</span></a> <a href="https://hear-me.social/tags/ScreenReader" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ScreenReader</span></a></p>
Debby ‬⁂📎🐧:disability_flag:<p>Goode <span class="h-card" translate="no"><a href="https://techhub.social/@thorstenvoice" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>thorstenvoice</span></a></span>, just found your channel and I'm impressed! Your work on TTS is fantastic and so important for accessibility in the FLOSS community. Keep it up! <a href="https://hear-me.social/tags/AccessibilityMatters" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AccessibilityMatters</span></a> <a href="https://hear-me.social/tags/FLOSS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>FLOSS</span></a> <a href="https://hear-me.social/tags/TTS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TTS</span></a> <a href="https://hear-me.social/tags/OpenSource" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>OpenSource</span></a> <a href="https://hear-me.social/tags/Inclusivity" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Inclusivity</span></a> <a href="https://hear-me.social/tags/FOSS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>FOSS</span></a> <a href="https://hear-me.social/tags/Coqui" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Coqui</span></a> <a href="https://hear-me.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://hear-me.social/tags/CoquiAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>CoquiAI</span></a> <a href="https://hear-me.social/tags/VoiceAssistant" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VoiceAssistant</span></a> <a href="https://hear-me.social/tags/Sprachassistent" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Sprachassistent</span></a> <a href="https://hear-me.social/tags/VoiceTechnology" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VoiceTechnology</span></a> <a href="https://hear-me.social/tags/K%C3%BCnstlicheStimme" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>KünstlicheStimme</span></a> <a href="https://hear-me.social/tags/MachineLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>MachineLearning</span></a> <a href="https://hear-me.social/tags/Python" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Python</span></a> <a href="https://hear-me.social/tags/Rhasspy" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Rhasspy</span></a> <a href="https://hear-me.social/tags/TextToSpeech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TextToSpeech</span></a> <a href="https://hear-me.social/tags/VoiceTech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VoiceTech</span></a> <a href="https://hear-me.social/tags/STT" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>STT</span></a> <a href="https://hear-me.social/tags/SpeechSynthesis" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechSynthesis</span></a> <a href="https://hear-me.social/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechRecognition</span></a> <a href="https://hear-me.social/tags/Sprachsynthese" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Sprachsynthese</span></a> <a href="https://hear-me.social/tags/ArtificialVoice" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ArtificialVoice</span></a> <a href="https://hear-me.social/tags/VoiceCloning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VoiceCloning</span></a> <a href="https://hear-me.social/tags/Spracherkennung" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Spracherkennung</span></a> <a href="https://hear-me.social/tags/CoquiTTS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>CoquiTTS</span></a> <a href="https://hear-me.social/tags/voice" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>voice</span></a> <a href="https://hear-me.social/tags/a11y" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>a11y</span></a> <a href="https://hear-me.social/tags/ScreenReader" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ScreenReader</span></a></p>
IT News<p>Christmas Comes Early With AI Santa Demo - With only two hundred odd days ’til Christmas, you just know we’re already feeling... - <a href="https://hackaday.com/2025/05/18/christmas-comes-early-with-ai-santa-demo/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">hackaday.com/2025/05/18/christ</span><span class="invisible">mas-comes-early-with-ai-santa-demo/</span></a> <a href="https://schleuss.online/tags/artificialintelligence" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>artificialintelligence</span></a> <a href="https://schleuss.online/tags/speechrecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>speechrecognition</span></a> <a href="https://schleuss.online/tags/speechsynthesis" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>speechsynthesis</span></a> <a href="https://schleuss.online/tags/santaclaus" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>santaclaus</span></a> <a href="https://schleuss.online/tags/libpeer" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>libpeer</span></a> <a href="https://schleuss.online/tags/openai" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>openai</span></a> <a href="https://schleuss.online/tags/llm" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>llm</span></a> <a href="https://schleuss.online/tags/ai" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ai</span></a></p>
Doug Holton<p>Vibe is an <a href="https://mastodon.social/tags/OpenSource" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>OpenSource</span></a> desktop client (mac, windows, linux) for locally running Whisper to more accurately transcribe or caption videos &amp; audio <a href="https://thewh1teagle.github.io/vibe/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">thewh1teagle.github.io/vibe/</span><span class="invisible"></span></a> Source code: <a href="https://github.com/thewh1teagle/vibe/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">github.com/thewh1teagle/vibe/</span><span class="invisible"></span></a> Easier to use than what I was using before (WhisperDesktop). Default settings use the medium Whisper model, which has been good enough in my experience.<br><a href="https://mastodon.social/tags/Accessibility" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Accessibility</span></a> <a href="https://mastodon.social/tags/A11y" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>A11y</span></a> <a href="https://mastodon.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://mastodon.social/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechRecognition</span></a> <a href="https://mastodon.social/tags/EdTech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>EdTech</span></a></p>
The Conversation U.S.<p>Speech recognition systems struggle with accents and dialects, risking problems in critical fields like healthcare and emergency services. Imagine calling 911 and the AI used to screen out non-emergency calls can’t understand you. </p><p>A Spanish language professor explains: <a href="https://theconversation.com/sorry-i-didnt-get-that-ai-misunderstands-some-peoples-words-more-than-others-239281" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">theconversation.com/sorry-i-di</span><span class="invisible">dnt-get-that-ai-misunderstands-some-peoples-words-more-than-others-239281</span></a> <a href="https://newsie.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://newsie.social/tags/speechrecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>speechrecognition</span></a></p>
Kuketz-Blog 🛡<p><a href="https://social.tchncs.de/tags/UnplugTrump" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>UnplugTrump</span></a> - Tipp5: <br>Verabschiede dich von Alexa und anderen Sprachassistenten, die deine Gespräche mithören und auswerten. Nutze stattdessen eine datenschutzfreundliche Alternative wie OpenVoiceOS, ein Open-Source-Sprachassistent, der von einer aktiven Community weiterentwickelt wird und auf einem RaspberryPi läuft. So behältst du die Kontrolle über deine Daten.</p><p><a href="https://social.tchncs.de/tags/Alexa" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Alexa</span></a> <a href="https://social.tchncs.de/tags/OpenVoiceOS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>OpenVoiceOS</span></a> <a href="https://social.tchncs.de/tags/Sprachassistent" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Sprachassistent</span></a> <a href="https://social.tchncs.de/tags/VoiceControl" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VoiceControl</span></a> <a href="https://social.tchncs.de/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechRecognition</span></a> <a href="https://social.tchncs.de/tags/datenschutz" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>datenschutz</span></a> <a href="https://social.tchncs.de/tags/privacy" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>privacy</span></a></p>
Mac<p>Browsing with <a href="https://mastodon.social/tags/speechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>speechRecognition</span></a></p><p><a href="https://www.youtube.com/watch?v=iEiSez9F79o" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">youtube.com/watch?v=iEiSez9F79</span><span class="invisible">o</span></a></p><p><a href="https://mastodon.social/tags/accessibility" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>accessibility</span></a> <a href="https://mastodon.social/tags/browser" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>browser</span></a> <a href="https://mastodon.social/tags/web" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>web</span></a></p>
SleepyCatten<p>Hey folks :FediverseSymbol: </p><p>We've actually done an unwritten, off-the-cusp trans voice Friday recording today :TransHeart: </p><p>We've not listened back to it, because voice dysphoria, but we've added full alt text.</p><p>In case you're wondering how we've done that without listening back to it, we've once against used an amazing tool called <a href="https://www.nikse.dk/subtitleedit" rel="nofollow noopener" target="_blank">Subtitle Edit</a>, which has audio to text functionality via the <a href="https://github.com/Purfview/whisper-standalone-win" rel="nofollow noopener" target="_blank">Whisper</a> speech recognition engine.</p><p>We used the large-v3 model, which is about 3.1 GB, but gives incredibly accurate transcription.</p><p>In case anyone can't access the alt text, we've added the full transcript below too.</p><p><a href="https://cultofshiv.wtf/tags/TransVoiceFriday" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TransVoiceFriday</span></a> <a href="https://cultofshiv.wtf/tags/TransVoice" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TransVoice</span></a> <a href="https://cultofshiv.wtf/tags/voice" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>voice</span></a> <a href="https://cultofshiv.wtf/tags/VoiceFeminisation" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VoiceFeminisation</span></a> <a href="https://cultofshiv.wtf/tags/VoiceFeminization" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VoiceFeminization</span></a> <a href="https://cultofshiv.wtf/tags/VoiceTraining" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VoiceTraining</span></a> <a href="https://cultofshiv.wtf/tags/trans" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>trans</span></a> <a href="https://cultofshiv.wtf/tags/transgender" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>transgender</span></a> <a href="https://cultofshiv.wtf/tags/TransFem" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TransFem</span></a> <a href="https://cultofshiv.wtf/tags/VoiceDysphoria" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VoiceDysphoria</span></a> <a href="https://cultofshiv.wtf/tags/SubtitleEdit" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SubtitleEdit</span></a> <a href="https://cultofshiv.wtf/tags/PurfviewWhisper" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>PurfviewWhisper</span></a> <a href="https://cultofshiv.wtf/tags/AudioToText" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AudioToText</span></a> <a href="https://cultofshiv.wtf/tags/SpeechToText" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechToText</span></a> <a href="https://cultofshiv.wtf/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechRecognition</span></a> </p><blockquote><p>Hey folks, I know that we haven't done a voice note in forever, and that's been for a multitude of reasons, some of which are related to mental health, some of which are related to work, stress, anxiety, depression, etc, things like that, which comes under mental health anyway, yeah, partly due to poor time management, yay for being AuDHD! But not gonna lie, some of it does come down to underlying voice dysphoria, because this is the best we've managed to get since December 2021. And just for anyone who hasn't heard roughly what we sounded like beforehand, we haven't exactly moved our voice up a lot. I mean, the base level would just be down here. So I can move my voice back up here easily now, and this is the comfortable, this is the default voice. But, um... It's not where I want it to be, it's not in the female range, and I can't easily push the pitch up higher without it sounding wrong. But yeah, there's been a lot of stuff going on recently, um, a lot of bad stuff for everyone, don't want to talk about all of that. But, um, let's just focus on supporting each other, helping each other, um, being kind to ourselves and others right now, and being compassionate and empathetic. That's all I've really got to say. I'm trying to do the same thing with ourselves, but yeah, it's hard sometimes. Anyway, ta-ta for now.</p></blockquote>
Kathy Reid<p>Absolutely the fuck not. </p><p><a href="https://aus.social/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechRecognition</span></a> <a href="https://aus.social/tags/Advertising" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Advertising</span></a> <a href="https://aus.social/tags/Surveillance" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Surveillance</span></a></p><p><a href="https://therecord.media/ford-patent-application-in-vehicle-listening-advertising" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">therecord.media/ford-patent-ap</span><span class="invisible">plication-in-vehicle-listening-advertising</span></a></p>
Thorsten Rochelmeyer<p>Gibt es aktuell eine gut funktionierende und anwendungsfreundliche Möglichkeit, Text direkt in ein <a href="https://climatejustice.social/tags/Libreoffice" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Libreoffice</span></a>-Dokument zu <a href="https://climatejustice.social/tags/diktieren" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>diktieren</span></a>? <br>Lokale Lösungen! Keinesfalls Cloud. </p><p>Ich kenne den Weg, eine Audio-Datei mit <a href="https://climatejustice.social/tags/Whisper" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Whisper</span></a> zu transkribieren. Das ist super, nutzt mir aber aktuell gerade recht wenig. Ich bräuchte was, wo der Prozess schon während des Sprechens läuft ...so wie bei Dragon und ähnlichen "Diktiersystemen", die es mal gab (und von denen ich nicht weiß, ob sie noch existieren). </p><p><a href="https://climatejustice.social/tags/Linux" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Linux</span></a> <a href="https://climatejustice.social/tags/STT" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>STT</span></a> <a href="https://climatejustice.social/tags/SpeechToText" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechToText</span></a> <a href="https://climatejustice.social/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechRecognition</span></a></p>