eupolicy.social is one of the many independent Mastodon servers you can use to participate in the fediverse.
This Mastodon server is a friendly and respectful discussion space for people working in areas related to EU policy. When you request to create an account, please tell us something about you.

Server stats:

201
active users

#speechsynthesis

0 posts0 participants0 posts today
Georgiana Brummell<p>Since my previous post asking about programmers, etc. received so many positive responses, I am going to use the same tags and explain what I wish to accomplish. That way, I can learn from real experts what is possible and what isn't. Note that I am not a programmer and am just writing as a user.</p><p>Hello, everyone. I am forty-one and totally blind, having never seen. I have loved DOS since I was a teenager and basically taught myself tto use it, since by the time I learned about it, people were already moving to Windows. I love XP and 7 but find 11 to be frustrating and annoying. Unlike many, I don't find Linux or Mac OS to be worthy replacements. But I strongly feel, given the general advances in technology, as well as those in modern versions of DOS, that it can be a viable alternative. It's quick, efficient, and text-based. This, then, is my ultimate vision. Some of these things may be easier to implement than others, and some may not even be possible. One of my favourite sites is this one, which debunks all sorts of fallacies related to DOS and gives me hope that my dream may someday be realised.</p><p><a href="http://www.chebucto.ns.ca/~ak621/DOS/DOS-Fal.html" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">http://www.</span><span class="ellipsis">chebucto.ns.ca/~ak621/DOS/DOS-</span><span class="invisible">Fal.html</span></a></p><p>Summary</p><p>My ultimate vision is a 32-bit version of DOS with true multi-tasking, a talking installer, an updated screen reader, a software synthesizer, and usb support that could be used as a daily operating system on modern (or at least semi-modern) hardware.</p><p>Blind-Specific Goals</p><p>1. Talking installer: One of the main difficulties of installing DOS for a blind person is the lack of speech without a dedicated screen reader. This was true even in Windows XP, and to a lesser degree, 7, though Talking Windows PE (a version with the NVDA screen reader slipstreamed into it) changed that. I have also seen someone load config.sys, autoexec.bat, and command.com along with the ASAP screen reader onto a floppy and boot from it, so it may, indeed, be possible, though booting from a floppy is automatic, whereas booting from anything else would require changing the bootloader, which is not accessible to the blind. If it is not possible to create a talking installer, perhaps some sort of batch system, similar to XP Unattended, can be created, so that the user just has to hit a few keys and start an automatic installation.</p><p>2. Software speech synthesis or reasonable alternative: This might be one of the most difficult things to implement, but it is th emost important. As it stands, most DOS screen readers work with hardware synthesizers that connect either via a serial port or an internal card. They work well, but unless new ones are made, they may be difficult to find. Plus, many computers don't have a serial port, and I'm not sure usb to serial can even work in DOS, especially for this sort of thing. Ideally, there would be a synthesizer, similar to ESpeak in NVDA, that would work directly with the screen reader to voice text on the screen. However, it seems that these sorts of synthesizers require apis, etc. that DOS doesn't have. Whether it would be possible to simulate a hardware synthesizer in real DOS as is done in the Talking DOSBox, which also contains Windows 95, I don't know. It is possible to send speech directly to the pc speaker, but most pc speakers, when they exist, are designed for beeps and very low quality output. That said, there was a novelty synthesizer, called Tran, that did just this. Perhaps a more serious version could be created and connected to a screen reader. There were screen readers that worked with the SoundBlaster synthesizer which did use software, but even that required the real card to be installed. If drivers and synthesizers can be created for more modern soundcards, that might be a bridge between full software synthesis and requiring an external device. A final option is simply to create modern synthesizers with an RS-232 connection. At least the speech would be good and they would still be manufactured, unlike the older ones.</p><p>3. Updated screen reader support: I don't know how much screen readers would need to be updated in order to be able to take advantage of modern programs and versions of DOS, but having that option would be a good thing. The only fully open source screen reader I know of is Provox. While JAWS for DOS, Vocal-Eyes, Flipper, etc. were all made freely available, we don't have their code. I am going to attempt to contact Larry Skutchan, maker of ASAP, to ask if he is willing to let us work with the code, or rewrite and update it, as he may no longer have the program.</p><p>General Goals</p><p>1. 32-bit: Even in Windows, I don't see the need for a 64-bit system. But I do think that DOS can benefit dramatically from being upgraded to 32-bit. It would mean more memory could be used in ram, true multi-tasking without extra tools could be done, and maybe, some of the blind-specific ideas of mine could be accomplished. I really cannot stress the importance of multitasking enough, even for mainstream things such as browsing the Internet while keeping an e-mail client open to alert for notifications, or even listening to music while reading a website or downloading something. I am fully aware of tsr programs, and they are wonderful, but they don't allow for background processes. I have heard of FreeDOs-32, but it seems to be no longer maintained.</p><p>2. Full usb support: I know that there is very rudamentary support for usb storage, but if this could be expanded to other devices, it might be possible to use a sound card for speech, a usb keyboard, a camera or scanner for ocr, a wifi dongle, etc.</p><p>3. An accessible, modern browser and wifi support: I know that it is possible to connect to the Internet using certain wireless cards. I also know that there is at least one graphical browser called Arachne. But whether it is accessible or has been updated, and whether more exist, I don't know. And what about systems without these cards? Can they access the Internet using wifi or at least cable via an ethernet connection?</p><p>4. A text-based, menu-driven desktop: I love the commandline, but sometimes, it might be quicker and/or easier to use menus. The graphical desktops require use of the mouse. I want to retain the text-based nature of DOS. It seems that this may already exist, and that I need to research DOS Navigator, Volkov Commander, Midnight Commander, and Norton Commander.</p><p>Things to Avoid</p><p>Don't turn DOS into Linux or Windows. Keep program installation simple, don't start requiring permissions for things, don't make everything graphical with a terrible interface that keeps changing, ribbons, etc., and don't include artificial intelligence as mandatory.</p><p><a href="https://someplace.social/tags/accessibility" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>accessibility</span></a> <a href="https://someplace.social/tags/AdaptiveTechnology" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AdaptiveTechnology</span></a> <a href="https://someplace.social/tags/blind" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>blind</span></a> <a href="https://someplace.social/tags/DOS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DOS</span></a> <a href="https://someplace.social/tags/FreeDOS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>FreeDOS</span></a> <a href="https://someplace.social/tags/Internet" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Internet</span></a> <a href="https://someplace.social/tags/MSDOS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>MSDOS</span></a> <a href="https://someplace.social/tags/programmers" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>programmers</span></a> <a href="https://someplace.social/tags/programming" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>programming</span></a> <a href="https://someplace.social/tags/ScreenReaders" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ScreenReaders</span></a> <a href="https://someplace.social/tags/software" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>software</span></a> <a href="https://someplace.social/tags/SpeechSynthesis" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechSynthesis</span></a> <a href="https://someplace.social/tags/technology" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>technology</span></a></p>
Georgiana Brummell<p>Would anyone be able to help me contact Larry Skutchan, or could someone please pass a question on to him? I don't wish to disturb him, especially now that he is retired. Several years ago, I recall asking him about ASAP and if I could obtain a full version of it. He told me that he no longer had it, but that the demonstration was fully functional, with only some reminders to buy the product. I must assume that, since he lost the program, he also lost the source code. But if I knew the language in which it was written, I could find a programmer to rewrite the code, updating it to add support for software synthesizers, various modern things, and even create new set files for updated and current programs. But naturally, I need his permission to do so, since it is not open source and I don't want to get in any sort of legal trouble. While Provox is, indeed, open source, ASAP is one of the most advanced and flexible DOS screen readers, and I would like to use it in my project to make FreeDOS and its programs more accessible.</p><p><a href="https://someplace.social/tags/accessibility" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>accessibility</span></a> <a href="https://someplace.social/tags/APH" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>APH</span></a> <a href="https://someplace.social/tags/ASAP" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ASAP</span></a> <a href="https://someplace.social/tags/blind" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>blind</span></a> <a href="https://someplace.social/tags/DOS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DOS</span></a> <a href="https://someplace.social/tags/FreeDOS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>FreeDOS</span></a> <a href="https://someplace.social/tags/programming" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>programming</span></a> <a href="https://someplace.social/tags/ScreenReader" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ScreenReader</span></a> <a href="https://someplace.social/tags/ScreenReader" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ScreenReader</span></a> <a href="https://someplace.social/tags/SpeechSynthesis" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechSynthesis</span></a> <a href="https://someplace.social/tags/technology" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>technology</span></a></p>
Georgiana Brummell<p>I cannot figure out how to get real DOS (of any kind) working in VMWare with Com0com and NVDA (my hardware synthesizers are packed away at the moment), so right now, I have Talking DOSBox. Since it already speaks and has various synthesizers available, I would like to know if it would be possible to substitute MS-DOS with FreeDOS 1.4, since I want to try the advanced features, modern programs, etc.</p><p><a href="https://freedos.org/download/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">freedos.org/download/</span><span class="invisible"></span></a></p><p>The main problem I see is installing it with speech. Perplexity gave me instructions that seemed viable, but upon actually looking in the various directories, I discovered that the reason Talking DOSBox works with the SoundBlaster synthesizer is that it's not pure MS-DOS but the version that comes with Windows for Work Groups. There is another way to access speech, so that NVDA acts as a bns driver, but I'm not sure if this would work, either with MS-DOS or FreeDOS. Can anyone assist me?</p><p><a href="https://someplace.social/tags/accessibility" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>accessibility</span></a> <a href="https://someplace.social/tags/blind" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>blind</span></a> <a href="https://someplace.social/tags/computing" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>computing</span></a> <a href="https://someplace.social/tags/DOS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DOS</span></a> <a href="https://someplace.social/tags/DOSBox" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>DOSBox</span></a> <a href="https://someplace.social/tags/FreeDOS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>FreeDOS</span></a> <a href="https://someplace.social/tags/NVDA" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>NVDA</span></a> <a href="https://someplace.social/tags/OperatingSystems" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>OperatingSystems</span></a> <a href="https://someplace.social/tags/ScreenReaders" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ScreenReaders</span></a> <a href="https://someplace.social/tags/SpeechSynthesis" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechSynthesis</span></a> <a href="https://someplace.social/tags/technology" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>technology</span></a> <a href="https://someplace.social/tags/VirtualMachines" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VirtualMachines</span></a> <a href="https://someplace.social/tags/VMWare" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VMWare</span></a> <a href="https://someplace.social/tags/Windows" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Windows</span></a></p>
IT News<p>Convert Any Book to a DIY Audiobook? - If the idea of reading a physical book sounds like hard work, [Nick Bild’s] latest... - <a href="https://hackaday.com/2025/07/06/convert-any-book-to-a-diy-audiobook/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">hackaday.com/2025/07/06/conver</span><span class="invisible">t-any-book-to-a-diy-audiobook/</span></a> <a href="https://schleuss.online/tags/artificialintelligence" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>artificialintelligence</span></a> <a href="https://schleuss.online/tags/raspberrypizero2w" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>raspberrypizero2w</span></a> <a href="https://schleuss.online/tags/googlegemini2" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>googlegemini2</span></a>.5 <a href="https://schleuss.online/tags/speechsynthesis" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>speechsynthesis</span></a> <a href="https://schleuss.online/tags/raspberrypi" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>raspberrypi</span></a> <a href="https://schleuss.online/tags/pipervoice" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>pipervoice</span></a> <a href="https://schleuss.online/tags/webcam" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>webcam</span></a> <a href="https://schleuss.online/tags/genai" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>genai</span></a> <a href="https://schleuss.online/tags/cv2" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>cv2</span></a> <a href="https://schleuss.online/tags/ocr" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ocr</span></a> <a href="https://schleuss.online/tags/ai" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ai</span></a></p>
Debby ‬⁂📎🐧:disability_flag:<p><span class="h-card" translate="no"><a href="https://mastodon.social/@thelinuxEXP" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>thelinuxEXP</span></a></span> I really like Speech Note! It's a fantastic tool for quick and local voice transcription in multiple languages, created by <span class="h-card" translate="no"><a href="https://mastodon.social/@mkiol" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>mkiol</span></a></span> </p><p>It's incredibly handy for capturing thoughts on the go, conducting interviews, or making voice memos without worrying about language barriers. The app uses strictly locally running LLMs, and its ease of use makes it a standout choice for anyone needing offline transcription services.</p><p>I primarily use <a href="https://hear-me.social/tags/WhisperAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>WhisperAI</span></a> for transcription and Piper for voice, but many other models are available as well. </p><p>It is available as flatpak and <a href="https://github.com/mkiol/dsnote" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">github.com/mkiol/dsnote</span><span class="invisible"></span></a> </p><p><a href="https://hear-me.social/tags/TTS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TTS</span></a> <a href="https://hear-me.social/tags/transcription" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>transcription</span></a> <a href="https://hear-me.social/tags/TextToSpeech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TextToSpeech</span></a> <a href="https://hear-me.social/tags/translator" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>translator</span></a> translation <a href="https://hear-me.social/tags/offline" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>offline</span></a> <a href="https://hear-me.social/tags/machinetranslation" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>machinetranslation</span></a> <a href="https://hear-me.social/tags/sailfishos" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>sailfishos</span></a> <a href="https://hear-me.social/tags/SpeechSynthesis" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechSynthesis</span></a> <a href="https://hear-me.social/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechRecognition</span></a> <a href="https://hear-me.social/tags/speechtotext" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>speechtotext</span></a> <a href="https://hear-me.social/tags/nmt" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>nmt</span></a> <a href="https://hear-me.social/tags/linux" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>linux</span></a>-desktop <a href="https://hear-me.social/tags/stt" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>stt</span></a> <a href="https://hear-me.social/tags/asr" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>asr</span></a> <a href="https://hear-me.social/tags/flatpak" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>flatpak</span></a>-applications <a href="https://hear-me.social/tags/SpeechNote" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechNote</span></a></p>
Debby ‬⁂📎🐧:disability_flag:<p>🌟 Excited to share Thorsten-Voice's YouTube channel! 🎥 🗣️🔊 ♿ 💬</p><p>Thorsten presents innovative TTS solutions and a variety of voice technologies, making it an excellent starting point for anyone interested in open-source text-to-speech. Whether you're a developer, accessibility advocate, or tech enthusiast, his channel offers valuable insights and resources. Don't miss out on this fantastic content! 🎬</p><p>follow hem here: <span class="h-card" translate="no"><a href="https://techhub.social/@thorstenvoice" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>thorstenvoice</span></a></span> <br>or on YouTube: <a href="https://www.youtube.com/@ThorstenMueller" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="">youtube.com/@ThorstenMueller</span><span class="invisible"></span></a> YouTube channel! </p><p><a href="https://hear-me.social/tags/Accessibility" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Accessibility</span></a> <a href="https://hear-me.social/tags/FLOSS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>FLOSS</span></a> <a href="https://hear-me.social/tags/TTS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TTS</span></a> <a href="https://hear-me.social/tags/ParlerTTS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ParlerTTS</span></a> <a href="https://hear-me.social/tags/OpenSource" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>OpenSource</span></a> <a href="https://hear-me.social/tags/VoiceTech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VoiceTech</span></a> <a href="https://hear-me.social/tags/TextToSpeech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TextToSpeech</span></a> <a href="https://hear-me.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://hear-me.social/tags/CoquiAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>CoquiAI</span></a> <a href="https://hear-me.social/tags/VoiceAssistant" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VoiceAssistant</span></a> <a href="https://hear-me.social/tags/Sprachassistent" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Sprachassistent</span></a> <a href="https://hear-me.social/tags/MachineLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>MachineLearning</span></a> <a href="https://hear-me.social/tags/AccessibilityMatters" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AccessibilityMatters</span></a> <a href="https://hear-me.social/tags/FLOSS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>FLOSS</span></a> <a href="https://hear-me.social/tags/TTS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TTS</span></a> <a href="https://hear-me.social/tags/OpenSource" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>OpenSource</span></a> <a href="https://hear-me.social/tags/Inclusivity" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Inclusivity</span></a> <a href="https://hear-me.social/tags/FOSS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>FOSS</span></a> <a href="https://hear-me.social/tags/Coqui" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Coqui</span></a> <a href="https://hear-me.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://hear-me.social/tags/CoquiAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>CoquiAI</span></a> <a href="https://hear-me.social/tags/VoiceAssistant" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VoiceAssistant</span></a> <a href="https://hear-me.social/tags/Sprachassistent" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Sprachassistent</span></a> <a href="https://hear-me.social/tags/VoiceTechnology" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VoiceTechnology</span></a> <a href="https://hear-me.social/tags/K%C3%BCnstlicheStimme" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>KünstlicheStimme</span></a> <a href="https://hear-me.social/tags/MachineLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>MachineLearning</span></a> <a href="https://hear-me.social/tags/Python" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Python</span></a> <a href="https://hear-me.social/tags/Rhasspy" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Rhasspy</span></a> <a href="https://hear-me.social/tags/TextToSpeech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TextToSpeech</span></a> <a href="https://hear-me.social/tags/VoiceTech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VoiceTech</span></a> <a href="https://hear-me.social/tags/STT" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>STT</span></a> <a href="https://hear-me.social/tags/SpeechSynthesis" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechSynthesis</span></a> <a href="https://hear-me.social/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechRecognition</span></a> <a href="https://hear-me.social/tags/Sprachsynthese" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Sprachsynthese</span></a> <a href="https://hear-me.social/tags/ArtificialVoice" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ArtificialVoice</span></a> <a href="https://hear-me.social/tags/VoiceCloning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VoiceCloning</span></a> <a href="https://hear-me.social/tags/Spracherkennung" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Spracherkennung</span></a> <a href="https://hear-me.social/tags/CoquiTTS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>CoquiTTS</span></a> <a href="https://hear-me.social/tags/voice" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>voice</span></a> <a href="https://hear-me.social/tags/a11y" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>a11y</span></a> <a href="https://hear-me.social/tags/ScreenReader" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ScreenReader</span></a></p>
Debby ‬⁂📎🐧:disability_flag:<p>Goode <span class="h-card" translate="no"><a href="https://techhub.social/@thorstenvoice" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>thorstenvoice</span></a></span>, just found your channel and I'm impressed! Your work on TTS is fantastic and so important for accessibility in the FLOSS community. Keep it up! <a href="https://hear-me.social/tags/AccessibilityMatters" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AccessibilityMatters</span></a> <a href="https://hear-me.social/tags/FLOSS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>FLOSS</span></a> <a href="https://hear-me.social/tags/TTS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TTS</span></a> <a href="https://hear-me.social/tags/OpenSource" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>OpenSource</span></a> <a href="https://hear-me.social/tags/Inclusivity" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Inclusivity</span></a> <a href="https://hear-me.social/tags/FOSS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>FOSS</span></a> <a href="https://hear-me.social/tags/Coqui" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Coqui</span></a> <a href="https://hear-me.social/tags/AI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>AI</span></a> <a href="https://hear-me.social/tags/CoquiAI" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>CoquiAI</span></a> <a href="https://hear-me.social/tags/VoiceAssistant" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VoiceAssistant</span></a> <a href="https://hear-me.social/tags/Sprachassistent" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Sprachassistent</span></a> <a href="https://hear-me.social/tags/VoiceTechnology" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VoiceTechnology</span></a> <a href="https://hear-me.social/tags/K%C3%BCnstlicheStimme" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>KünstlicheStimme</span></a> <a href="https://hear-me.social/tags/MachineLearning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>MachineLearning</span></a> <a href="https://hear-me.social/tags/Python" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Python</span></a> <a href="https://hear-me.social/tags/Rhasspy" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Rhasspy</span></a> <a href="https://hear-me.social/tags/TextToSpeech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TextToSpeech</span></a> <a href="https://hear-me.social/tags/VoiceTech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VoiceTech</span></a> <a href="https://hear-me.social/tags/STT" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>STT</span></a> <a href="https://hear-me.social/tags/SpeechSynthesis" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechSynthesis</span></a> <a href="https://hear-me.social/tags/SpeechRecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechRecognition</span></a> <a href="https://hear-me.social/tags/Sprachsynthese" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Sprachsynthese</span></a> <a href="https://hear-me.social/tags/ArtificialVoice" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ArtificialVoice</span></a> <a href="https://hear-me.social/tags/VoiceCloning" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>VoiceCloning</span></a> <a href="https://hear-me.social/tags/Spracherkennung" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>Spracherkennung</span></a> <a href="https://hear-me.social/tags/CoquiTTS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>CoquiTTS</span></a> <a href="https://hear-me.social/tags/voice" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>voice</span></a> <a href="https://hear-me.social/tags/a11y" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>a11y</span></a> <a href="https://hear-me.social/tags/ScreenReader" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ScreenReader</span></a></p>
IT News<p>Christmas Comes Early With AI Santa Demo - With only two hundred odd days ’til Christmas, you just know we’re already feeling... - <a href="https://hackaday.com/2025/05/18/christmas-comes-early-with-ai-santa-demo/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">hackaday.com/2025/05/18/christ</span><span class="invisible">mas-comes-early-with-ai-santa-demo/</span></a> <a href="https://schleuss.online/tags/artificialintelligence" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>artificialintelligence</span></a> <a href="https://schleuss.online/tags/speechrecognition" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>speechrecognition</span></a> <a href="https://schleuss.online/tags/speechsynthesis" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>speechsynthesis</span></a> <a href="https://schleuss.online/tags/santaclaus" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>santaclaus</span></a> <a href="https://schleuss.online/tags/libpeer" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>libpeer</span></a> <a href="https://schleuss.online/tags/openai" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>openai</span></a> <a href="https://schleuss.online/tags/llm" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>llm</span></a> <a href="https://schleuss.online/tags/ai" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ai</span></a></p>
partizan<p>Сьогодні дивився на Open-Source Speech Synthesis, і все дуже цікаво.</p><p>Ну, спочатку, існують речі такі як `espeak-ng`, які можна встановити з репозиторію і вони наче як ... стандартні.</p><p>Але господи, яке воно страшне, найжахливіший синтезований голос шо я чув.</p><p>Далі я поліз гуглити, спочатку знайшов Mozilla TTS: <a href="https://github.com/mozilla/TTS/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">github.com/mozilla/TTS/</span><span class="invisible"></span></a> але воно схоже давно мертве. У Mozilla схоже є звичка шось починати і закидать.</p><p>Потім, знайшов <a href="https://github.com/coqui-ai/TTS" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">github.com/coqui-ai/TTS</span><span class="invisible"></span></a> ... В якому дуже цікаво виглядає те шо структура README дуже схожа з попереднім, команда інсталяції через pip така сама...</p><p>Вдалось його запустити, генерує непоганий голос, але така купа залежностей, тягте CUDA навіть коли воно мені не треба, але працює.</p><p>Далі цікавіше, Tortoise TTS:</p><p><a href="https://huggingface.co/spaces/Manmay/tortoise-tts" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">huggingface.co/spaces/Manmay/t</span><span class="invisible">ortoise-tts</span></a></p><p>Ось тут воно працює і непогано, але якшо спробувати запустити локально, то як мінімум на ноутбуці все настільки повільно шо я не дочекався поки згенерується одна фраза. Мабуть правду писали в README шо треба NVIDIA GPU.</p><p>Потім я знайшов ось цей реддіт тред, <a href="https://www.reddit.com/r/MachineLearning/comments/10yzq25/d_locallyrunnable_text_to_speech_ai/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://www.</span><span class="ellipsis">reddit.com/r/MachineLearning/c</span><span class="invisible">omments/10yzq25/d_locallyrunnable_text_to_speech_ai/</span></a></p><p>Пішов дивитись на Mimic, і десь там на форумі побачив шо вони out of business, зате подивіть на `piper-tts`.</p><p>І ось тут починаєтья найцікавіше: <a href="https://github.com/rhasspy/piper" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="">github.com/rhasspy/piper</span><span class="invisible"></span></a></p><p>&gt; A fast, local neural text to speech system</p><p>Є варіанти встановити як модуль python, є бінарник. Я спочатку думав шо якийсь з python, але ні. І воно генерує дуже непогану мову, дуже швидко, і без 10 гігабайт dependencies.</p><p>Дуже прикольна штука. Буду копати далі. Є навіть українські голоси, якість правда так собі, але є.</p><p><a href="https://rhasspy.github.io/piper-samples/" rel="nofollow noopener" translate="no" target="_blank"><span class="invisible">https://</span><span class="ellipsis">rhasspy.github.io/piper-sample</span><span class="invisible">s/</span></a></p><p>Єдина проблема, воно чомусь не сприймає newlines в тексті, доводиться робити отак:</p><p>```<br>echo $text | tr "\n\r" " " | ./piper -m ~/src/speak/en_US-lessac-medium.onnx -f - | paplay<br>```</p><p>Але то вже таке, шось придумаємо!</p><p><a href="https://twiukraine.com/tags/tts" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>tts</span></a> <a href="https://twiukraine.com/tags/SpeechSynthesis" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechSynthesis</span></a> <a href="https://twiukraine.com/tags/PiperTTS" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>PiperTTS</span></a></p>
Head·word /ˈhedˌwɜː(ɹ)d/ n.<p><span class="h-card" translate="no"><a href="https://mastodon.social/@ZachWeinersmith" class="u-url mention" rel="nofollow noopener" target="_blank">@<span>ZachWeinersmith</span></a></span> </p><p>Immensely funny. I'll add some relevant hashtags here:</p><p><a href="https://lingo.lol/tags/linguistics" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>linguistics</span></a> <a href="https://lingo.lol/tags/phonetics" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>phonetics</span></a> <a href="https://lingo.lol/tags/prosody" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>prosody</span></a> <a href="https://lingo.lol/tags/pragmatics" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>pragmatics</span></a> <a href="https://lingo.lol/tags/SpeechSounds" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechSounds</span></a> <a href="https://lingo.lol/tags/ArticulatoryPhonetics" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>ArticulatoryPhonetics</span></a> <a href="https://lingo.lol/tags/SpeechSynthesis" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>SpeechSynthesis</span></a> <a href="https://lingo.lol/tags/TextToSpeech" class="mention hashtag" rel="nofollow noopener" target="_blank">#<span>TextToSpeech</span></a></p>