eupolicy.social is one of the many independent Mastodon servers you can use to participate in the fediverse.
This Mastodon server is a friendly and respectful discussion space for people working in areas related to EU policy. When you request to create an account, please tell us something about you.

Server stats:

224
active users

#speechrecognition

0 posts0 participants0 posts today
Debby<p>🌟 Excited to share Thorsten-Voice's YouTube channel! 🎥 🗣️🔊 ♿ 💬</p><p>Thorsten presents innovative TTS solutions and a variety of voice technologies, making it an excellent starting point for anyone interested in open-source text-to-speech. Whether you're a developer, accessibility advocate, or tech enthusiast, his channel offers valuable insights and resources. Don't miss out on this fantastic content! 🎬</p><p>follow hem here: <span class="h-card" translate="no"><a href="https://techhub.social/@thorstenvoice" class="u-url mention" rel="nofollow noopener noreferrer" target="_blank">@<span>thorstenvoice</span></a></span> <br>or on YouTube: <a href="https://www.youtube.com/@ThorstenMueller" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="">youtube.com/@ThorstenMueller</span><span class="invisible"></span></a> YouTube channel! </p><p><a href="https://hear-me.social/tags/Accessibility" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Accessibility</span></a> <a href="https://hear-me.social/tags/FLOSS" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>FLOSS</span></a> <a href="https://hear-me.social/tags/TTS" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>TTS</span></a> <a href="https://hear-me.social/tags/ParlerTTS" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ParlerTTS</span></a> <a href="https://hear-me.social/tags/OpenSource" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>OpenSource</span></a> <a href="https://hear-me.social/tags/VoiceTech" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>VoiceTech</span></a> <a href="https://hear-me.social/tags/TextToSpeech" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>TextToSpeech</span></a> <a href="https://hear-me.social/tags/AI" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AI</span></a> <a href="https://hear-me.social/tags/CoquiAI" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>CoquiAI</span></a> <a href="https://hear-me.social/tags/VoiceAssistant" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>VoiceAssistant</span></a> <a href="https://hear-me.social/tags/Sprachassistent" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Sprachassistent</span></a> <a href="https://hear-me.social/tags/MachineLearning" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>MachineLearning</span></a> <a href="https://hear-me.social/tags/AccessibilityMatters" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AccessibilityMatters</span></a> <a href="https://hear-me.social/tags/FLOSS" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>FLOSS</span></a> <a href="https://hear-me.social/tags/TTS" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>TTS</span></a> <a href="https://hear-me.social/tags/OpenSource" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>OpenSource</span></a> <a href="https://hear-me.social/tags/Inclusivity" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Inclusivity</span></a> <a href="https://hear-me.social/tags/FOSS" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>FOSS</span></a> <a href="https://hear-me.social/tags/Coqui" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Coqui</span></a> <a href="https://hear-me.social/tags/AI" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AI</span></a> <a href="https://hear-me.social/tags/CoquiAI" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>CoquiAI</span></a> <a href="https://hear-me.social/tags/VoiceAssistant" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>VoiceAssistant</span></a> <a href="https://hear-me.social/tags/Sprachassistent" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Sprachassistent</span></a> <a href="https://hear-me.social/tags/VoiceTechnology" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>VoiceTechnology</span></a> <a href="https://hear-me.social/tags/K%C3%BCnstlicheStimme" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>KünstlicheStimme</span></a> <a href="https://hear-me.social/tags/MachineLearning" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>MachineLearning</span></a> <a href="https://hear-me.social/tags/Python" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Python</span></a> <a href="https://hear-me.social/tags/Rhasspy" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Rhasspy</span></a> <a href="https://hear-me.social/tags/TextToSpeech" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>TextToSpeech</span></a> <a href="https://hear-me.social/tags/VoiceTech" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>VoiceTech</span></a> <a href="https://hear-me.social/tags/STT" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>STT</span></a> <a href="https://hear-me.social/tags/SpeechSynthesis" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>SpeechSynthesis</span></a> <a href="https://hear-me.social/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>SpeechRecognition</span></a> <a href="https://hear-me.social/tags/Sprachsynthese" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Sprachsynthese</span></a> <a href="https://hear-me.social/tags/ArtificialVoice" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ArtificialVoice</span></a> <a href="https://hear-me.social/tags/VoiceCloning" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>VoiceCloning</span></a> <a href="https://hear-me.social/tags/Spracherkennung" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Spracherkennung</span></a> <a href="https://hear-me.social/tags/CoquiTTS" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>CoquiTTS</span></a> <a href="https://hear-me.social/tags/voice" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>voice</span></a> <a href="https://hear-me.social/tags/a11y" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>a11y</span></a> <a href="https://hear-me.social/tags/ScreenReader" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ScreenReader</span></a></p>
Debby<p>Goode <span class="h-card" translate="no"><a href="https://techhub.social/@thorstenvoice" class="u-url mention" rel="nofollow noopener noreferrer" target="_blank">@<span>thorstenvoice</span></a></span>, just found your channel and I'm impressed! Your work on TTS is fantastic and so important for accessibility in the FLOSS community. Keep it up! <a href="https://hear-me.social/tags/AccessibilityMatters" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AccessibilityMatters</span></a> <a href="https://hear-me.social/tags/FLOSS" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>FLOSS</span></a> <a href="https://hear-me.social/tags/TTS" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>TTS</span></a> <a href="https://hear-me.social/tags/OpenSource" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>OpenSource</span></a> <a href="https://hear-me.social/tags/Inclusivity" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Inclusivity</span></a> <a href="https://hear-me.social/tags/FOSS" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>FOSS</span></a> <a href="https://hear-me.social/tags/Coqui" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Coqui</span></a> <a href="https://hear-me.social/tags/AI" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AI</span></a> <a href="https://hear-me.social/tags/CoquiAI" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>CoquiAI</span></a> <a href="https://hear-me.social/tags/VoiceAssistant" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>VoiceAssistant</span></a> <a href="https://hear-me.social/tags/Sprachassistent" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Sprachassistent</span></a> <a href="https://hear-me.social/tags/VoiceTechnology" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>VoiceTechnology</span></a> <a href="https://hear-me.social/tags/K%C3%BCnstlicheStimme" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>KünstlicheStimme</span></a> <a href="https://hear-me.social/tags/MachineLearning" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>MachineLearning</span></a> <a href="https://hear-me.social/tags/Python" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Python</span></a> <a href="https://hear-me.social/tags/Rhasspy" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Rhasspy</span></a> <a href="https://hear-me.social/tags/TextToSpeech" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>TextToSpeech</span></a> <a href="https://hear-me.social/tags/VoiceTech" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>VoiceTech</span></a> <a href="https://hear-me.social/tags/STT" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>STT</span></a> <a href="https://hear-me.social/tags/SpeechSynthesis" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>SpeechSynthesis</span></a> <a href="https://hear-me.social/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>SpeechRecognition</span></a> <a href="https://hear-me.social/tags/Sprachsynthese" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Sprachsynthese</span></a> <a href="https://hear-me.social/tags/ArtificialVoice" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ArtificialVoice</span></a> <a href="https://hear-me.social/tags/VoiceCloning" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>VoiceCloning</span></a> <a href="https://hear-me.social/tags/Spracherkennung" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Spracherkennung</span></a> <a href="https://hear-me.social/tags/CoquiTTS" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>CoquiTTS</span></a> <a href="https://hear-me.social/tags/voice" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>voice</span></a> <a href="https://hear-me.social/tags/a11y" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>a11y</span></a> <a href="https://hear-me.social/tags/ScreenReader" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ScreenReader</span></a></p>
IT News<p>Christmas Comes Early With AI Santa Demo - With only two hundred odd days ’til Christmas, you just know we’re already feeling... - <a href="https://hackaday.com/2025/05/18/christmas-comes-early-with-ai-santa-demo/" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">hackaday.com/2025/05/18/christ</span><span class="invisible">mas-comes-early-with-ai-santa-demo/</span></a> <a href="https://schleuss.online/tags/artificialintelligence" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>artificialintelligence</span></a> <a href="https://schleuss.online/tags/speechrecognition" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>speechrecognition</span></a> <a href="https://schleuss.online/tags/speechsynthesis" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>speechsynthesis</span></a> <a href="https://schleuss.online/tags/santaclaus" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>santaclaus</span></a> <a href="https://schleuss.online/tags/libpeer" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>libpeer</span></a> <a href="https://schleuss.online/tags/openai" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>openai</span></a> <a href="https://schleuss.online/tags/llm" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>llm</span></a> <a href="https://schleuss.online/tags/ai" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ai</span></a></p>
Richard Emling (DO9RE)<p>I'm exploring ways to improve audio preprocessing for speech recognition for my [midi2hamlib](<a href="https://github.com/DO9RE/midi2hamlib" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="">github.com/DO9RE/midi2hamlib</span><span class="invisible"></span></a>) project. Do any of my followers have expertise with **SoX** or **speech recognition**? Specifically, I’m seeking advice on: 1️⃣ Best practices for audio preparation for speech recognition. 2️⃣ SoX command-line parameters that can optimize audio during recording or playback. <br> <a href="https://github.com/DO9RE/midi2hamlib/blob/main/tests/speech_menu.sh" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">github.com/DO9RE/midi2hamlib/b</span><span class="invisible">lob/main/tests/speech_menu.sh</span></a> <a href="https://metalhead.club/tags/SoX" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>SoX</span></a> <a href="https://metalhead.club/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>SpeechRecognition</span></a> <a href="https://metalhead.club/tags/OpenSource" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>OpenSource</span></a> <a href="https://metalhead.club/tags/AudioProcessing" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AudioProcessing</span></a> <a href="https://metalhead.club/tags/ShellScripting" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ShellScripting</span></a> <a href="https://metalhead.club/tags/Sphinx" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Sphinx</span></a> <a href="https://metalhead.club/tags/PocketSphinx" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>PocketSphinx</span></a> <a href="https://metalhead.club/tags/Audio" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Audio</span></a> Retoot appreciated.</p>
Pyrzout :vm:<p>Be Careful What You Ask For: Voice Control <a href="https://hackaday.com/2025/02/19/be-careful-what-you-ask-for-voice-control/" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">hackaday.com/2025/02/19/be-car</span><span class="invisible">eful-what-you-ask-for-voice-control/</span></a> <a href="https://social.skynetcloud.site/tags/speechrecognition" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>speechrecognition</span></a> <a href="https://social.skynetcloud.site/tags/computerspeech" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>computerspeech</span></a> <a href="https://social.skynetcloud.site/tags/voicecommand" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>voicecommand</span></a> <a href="https://social.skynetcloud.site/tags/Featured" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Featured</span></a> <a href="https://social.skynetcloud.site/tags/Rants" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Rants</span></a> <a href="https://social.skynetcloud.site/tags/rants" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>rants</span></a></p>
Doug Holton<p>Vibe is an <a href="https://mastodon.social/tags/OpenSource" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>OpenSource</span></a> desktop client (mac, windows, linux) for locally running Whisper to more accurately transcribe or caption videos &amp; audio <a href="https://thewh1teagle.github.io/vibe/" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="">thewh1teagle.github.io/vibe/</span><span class="invisible"></span></a> Source code: <a href="https://github.com/thewh1teagle/vibe/" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="">github.com/thewh1teagle/vibe/</span><span class="invisible"></span></a> Easier to use than what I was using before (WhisperDesktop). Default settings use the medium Whisper model, which has been good enough in my experience.<br><a href="https://mastodon.social/tags/Accessibility" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Accessibility</span></a> <a href="https://mastodon.social/tags/A11y" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>A11y</span></a> <a href="https://mastodon.social/tags/AI" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AI</span></a> <a href="https://mastodon.social/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>SpeechRecognition</span></a> <a href="https://mastodon.social/tags/EdTech" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>EdTech</span></a></p>
The Conversation U.S.<p>Speech recognition systems struggle with accents and dialects, risking problems in critical fields like healthcare and emergency services. Imagine calling 911 and the AI used to screen out non-emergency calls can’t understand you. </p><p>A Spanish language professor explains: <a href="https://theconversation.com/sorry-i-didnt-get-that-ai-misunderstands-some-peoples-words-more-than-others-239281" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">theconversation.com/sorry-i-di</span><span class="invisible">dnt-get-that-ai-misunderstands-some-peoples-words-more-than-others-239281</span></a> <a href="https://newsie.social/tags/AI" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AI</span></a> <a href="https://newsie.social/tags/speechrecognition" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>speechrecognition</span></a></p>
Mike Kuketz 🛡<p><a href="https://social.tchncs.de/tags/UnplugTrump" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>UnplugTrump</span></a> - Tipp5: <br>Verabschiede dich von Alexa und anderen Sprachassistenten, die deine Gespräche mithören und auswerten. Nutze stattdessen eine datenschutzfreundliche Alternative wie OpenVoiceOS, ein Open-Source-Sprachassistent, der von einer aktiven Community weiterentwickelt wird und auf einem RaspberryPi läuft. So behältst du die Kontrolle über deine Daten.</p><p><a href="https://social.tchncs.de/tags/Alexa" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Alexa</span></a> <a href="https://social.tchncs.de/tags/OpenVoiceOS" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>OpenVoiceOS</span></a> <a href="https://social.tchncs.de/tags/Sprachassistent" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Sprachassistent</span></a> <a href="https://social.tchncs.de/tags/VoiceControl" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>VoiceControl</span></a> <a href="https://social.tchncs.de/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>SpeechRecognition</span></a> <a href="https://social.tchncs.de/tags/datenschutz" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>datenschutz</span></a> <a href="https://social.tchncs.de/tags/privacy" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>privacy</span></a></p>
beetle_b<p>Using LLMs to clean up the output of speech recognition has been a game changer for me in the past year:</p><p><a href="https://blog.nawaz.org/posts/2023/Dec/cleaning-up-speech-recognition-with-gpt/" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">blog.nawaz.org/posts/2023/Dec/</span><span class="invisible">cleaning-up-speech-recognition-with-gpt/</span></a></p><p>Note: I've improved my workflow compared to that post. I should write a followup.</p><p><a href="https://mastodon.xyz/tags/gpt" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>gpt</span></a> <a href="https://mastodon.xyz/tags/chatgpt" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>chatgpt</span></a> <a href="https://mastodon.xyz/tags/llm" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>llm</span></a> <a href="https://mastodon.xyz/tags/speechrecognition" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>speechrecognition</span></a></p>
Mac<p>Browsing with <a href="https://mastodon.social/tags/speechRecognition" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>speechRecognition</span></a></p><p><a href="https://www.youtube.com/watch?v=iEiSez9F79o" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">youtube.com/watch?v=iEiSez9F79</span><span class="invisible">o</span></a></p><p><a href="https://mastodon.social/tags/accessibility" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>accessibility</span></a> <a href="https://mastodon.social/tags/browser" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>browser</span></a> <a href="https://mastodon.social/tags/web" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>web</span></a></p>
SleepyCatten<p>Hey folks :FediverseSymbol: </p><p>We've actually done an unwritten, off-the-cusp trans voice Friday recording today :TransHeart: </p><p>We've not listened back to it, because voice dysphoria, but we've added full alt text.</p><p>In case you're wondering how we've done that without listening back to it, we've once against used an amazing tool called <a href="https://www.nikse.dk/subtitleedit" rel="nofollow noopener noreferrer" target="_blank">Subtitle Edit</a>, which has audio to text functionality via the <a href="https://github.com/Purfview/whisper-standalone-win" rel="nofollow noopener noreferrer" target="_blank">Whisper</a> speech recognition engine.</p><p>We used the large-v3 model, which is about 3.1 GB, but gives incredibly accurate transcription.</p><p>In case anyone can't access the alt text, we've added the full transcript below too.</p><p><a href="https://cultofshiv.wtf/tags/TransVoiceFriday" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>TransVoiceFriday</span></a> <a href="https://cultofshiv.wtf/tags/TransVoice" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>TransVoice</span></a> <a href="https://cultofshiv.wtf/tags/voice" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>voice</span></a> <a href="https://cultofshiv.wtf/tags/VoiceFeminisation" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>VoiceFeminisation</span></a> <a href="https://cultofshiv.wtf/tags/VoiceFeminization" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>VoiceFeminization</span></a> <a href="https://cultofshiv.wtf/tags/VoiceTraining" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>VoiceTraining</span></a> <a href="https://cultofshiv.wtf/tags/trans" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>trans</span></a> <a href="https://cultofshiv.wtf/tags/transgender" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>transgender</span></a> <a href="https://cultofshiv.wtf/tags/TransFem" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>TransFem</span></a> <a href="https://cultofshiv.wtf/tags/VoiceDysphoria" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>VoiceDysphoria</span></a> <a href="https://cultofshiv.wtf/tags/SubtitleEdit" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>SubtitleEdit</span></a> <a href="https://cultofshiv.wtf/tags/PurfviewWhisper" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>PurfviewWhisper</span></a> <a href="https://cultofshiv.wtf/tags/AudioToText" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AudioToText</span></a> <a href="https://cultofshiv.wtf/tags/SpeechToText" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>SpeechToText</span></a> <a href="https://cultofshiv.wtf/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>SpeechRecognition</span></a> </p><blockquote><p>Hey folks, I know that we haven't done a voice note in forever, and that's been for a multitude of reasons, some of which are related to mental health, some of which are related to work, stress, anxiety, depression, etc, things like that, which comes under mental health anyway, yeah, partly due to poor time management, yay for being AuDHD! But not gonna lie, some of it does come down to underlying voice dysphoria, because this is the best we've managed to get since December 2021. And just for anyone who hasn't heard roughly what we sounded like beforehand, we haven't exactly moved our voice up a lot. I mean, the base level would just be down here. So I can move my voice back up here easily now, and this is the comfortable, this is the default voice. But, um... It's not where I want it to be, it's not in the female range, and I can't easily push the pitch up higher without it sounding wrong. But yeah, there's been a lot of stuff going on recently, um, a lot of bad stuff for everyone, don't want to talk about all of that. But, um, let's just focus on supporting each other, helping each other, um, being kind to ourselves and others right now, and being compassionate and empathetic. That's all I've really got to say. I'm trying to do the same thing with ourselves, but yeah, it's hard sometimes. Anyway, ta-ta for now.</p></blockquote>
Kathy Reid<p>Absolutely the fuck not. </p><p><a href="https://aus.social/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>SpeechRecognition</span></a> <a href="https://aus.social/tags/Advertising" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Advertising</span></a> <a href="https://aus.social/tags/Surveillance" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Surveillance</span></a></p><p><a href="https://therecord.media/ford-patent-application-in-vehicle-listening-advertising" rel="nofollow noopener noreferrer" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">therecord.media/ford-patent-ap</span><span class="invisible">plication-in-vehicle-listening-advertising</span></a></p>
Thorsten Rochelmeyer<p>Gibt es aktuell eine gut funktionierende und anwendungsfreundliche Möglichkeit, Text direkt in ein <a href="https://climatejustice.social/tags/Libreoffice" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Libreoffice</span></a>-Dokument zu <a href="https://climatejustice.social/tags/diktieren" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>diktieren</span></a>? <br>Lokale Lösungen! Keinesfalls Cloud. </p><p>Ich kenne den Weg, eine Audio-Datei mit <a href="https://climatejustice.social/tags/Whisper" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Whisper</span></a> zu transkribieren. Das ist super, nutzt mir aber aktuell gerade recht wenig. Ich bräuchte was, wo der Prozess schon während des Sprechens läuft ...so wie bei Dragon und ähnlichen "Diktiersystemen", die es mal gab (und von denen ich nicht weiß, ob sie noch existieren). </p><p><a href="https://climatejustice.social/tags/Linux" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Linux</span></a> <a href="https://climatejustice.social/tags/STT" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>STT</span></a> <a href="https://climatejustice.social/tags/SpeechToText" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>SpeechToText</span></a> <a href="https://climatejustice.social/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>SpeechRecognition</span></a></p>
Jürgen Hubert<p>After recently voicing my frustration with how much effort it is for me to type in <a href="https://thefolklore.cafe/tags/AltText" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AltText</span></a> while mobile and away from my desktop PC, I am wondering whether using <a href="https://thefolklore.cafe/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>SpeechRecognition</span></a> with a <a href="https://thefolklore.cafe/tags/Bluetooth" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Bluetooth</span></a> headset might be a viable solution.</p><p>But I don't have much experience with either. Thus:</p><p>What kind of _lightweight_ Bluetooth headset could you recommend which would easily fit into the pocket of my jacket and the like (I like to travel lightly when doing bicycle trips)?</p><p>And what kind of speech-to-text app would work well when entering Alt-Text into my Mastodon app (currently using Tusky, but I am willing to change if that's what it takes)? Ideally it should support both English and German.</p>
Feilner IT<p><a href="https://mastodon.social/tags/ArtificialIntelligence" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>ArtificialIntelligence</span></a>: <a href="https://mastodon.social/tags/feilnerism" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>feilnerism</span></a> </p><p>We are using a term that we cannot define ("intelligence") for marketing by adding an adjective to make it sound more sophisticated and technical with the only intention of selling, making money out of it. Today, everything that is "magic" (see Clarke) has become "<a href="https://mastodon.social/tags/AI" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>AI</span></a>": <a href="https://mastodon.social/tags/LLM" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>LLM</span></a>, <a href="https://mastodon.social/tags/PatternRecognition" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>PatternRecognition</span></a>, <a href="https://mastodon.social/tags/MachineLearning" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>MachineLearning</span></a>, <a href="https://mastodon.social/tags/Cloud" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Cloud</span></a>, <a href="https://mastodon.social/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>SpeechRecognition</span></a>, <a href="https://mastodon.social/tags/Assistants" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Assistants</span></a>, ... all the stuff from the last decades. What is really new about it?</p>
Will M<p>Reposting my <a href="https://mastodon.sdf.org/tags/introduction" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>introduction</span></a> after the SDF database crash—</p><p>Hi everybody! I'm Will and I enjoy all things <a href="https://mastodon.sdf.org/tags/languages" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>languages</span></a> and <a href="https://mastodon.sdf.org/tags/code" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>code</span></a>. My day job is in <a href="https://mastodon.sdf.org/tags/nlp" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>nlp</span></a> (natural language processing <a href="https://mastodon.sdf.org/tags/nlproc" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>nlproc</span></a>) and <a href="https://mastodon.sdf.org/tags/speechrecognition" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>speechrecognition</span></a> for language education. In grad school I worked on <a href="https://mastodon.sdf.org/tags/Bayesian" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Bayesian</span></a> <a href="https://mastodon.sdf.org/tags/pragmatics" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>pragmatics</span></a> with <a href="https://mastodon.sdf.org/tags/deeplearning" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>deeplearning</span></a>.</p><p>I speak English natively, <a href="https://mastodon.sdf.org/tags/espa%C3%B1ol" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>español</span></a> <a href="https://mastodon.sdf.org/tags/Spanish" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Spanish</span></a> / <a href="https://mastodon.sdf.org/tags/%E4%B8%AD%E6%96%87" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>中文</span></a> <a href="https://mastodon.sdf.org/tags/Chinese" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Chinese</span></a> / <a href="https://mastodon.sdf.org/tags/%D8%A7%D9%84%D8%B9%D8%B1%D8%A8%D9%8A%D8%A9" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>العربية</span></a> <a href="https://mastodon.sdf.org/tags/Arabic" class="mention hashtag" rel="nofollow noopener noreferrer" target="_blank">#<span>Arabic</span></a> passably, and lots of others poorly. Talk to me in your language!</p>