A tool to generate voice files using various TTS providers
Re: A tool to generate voice files using various TTS providers
Cool, thank you
João
João
My er9x/Ersky9x/eepskye Video Tutorials
https://www.youtube.com/playlist?list=PL5uJhoD7sAKidZmkhMpYpp_qcuIqJXhb9
Donate to Er9x/Ersky9x:
https://www.paypal.com/cgi-bin/webscr?cmd=_s-xclick&hosted_button_id=YHX43JR3J7XGW
https://www.youtube.com/playlist?list=PL5uJhoD7sAKidZmkhMpYpp_qcuIqJXhb9
Donate to Er9x/Ersky9x:
https://www.paypal.com/cgi-bin/webscr?cmd=_s-xclick&hosted_button_id=YHX43JR3J7XGW
Release 2.8.1.1
- Scrollbar now displays correctly on list of phrases
Release 2.8.1.2
- Add support for opening phrase files encoded with code page 1250
- Update French translation
Re: A tool to generate voice files using various TTS providers
Could you please post (or let me know where I can find) the most recent PT translation file? I think there are a few things that might need changing
Thanks
João
Thanks
João
My er9x/Ersky9x/eepskye Video Tutorials
https://www.youtube.com/playlist?list=PL5uJhoD7sAKidZmkhMpYpp_qcuIqJXhb9
Donate to Er9x/Ersky9x:
https://www.paypal.com/cgi-bin/webscr?cmd=_s-xclick&hosted_button_id=YHX43JR3J7XGW
https://www.youtube.com/playlist?list=PL5uJhoD7sAKidZmkhMpYpp_qcuIqJXhb9
Donate to Er9x/Ersky9x:
https://www.paypal.com/cgi-bin/webscr?cmd=_s-xclick&hosted_button_id=YHX43JR3J7XGW
Re: A tool to generate voice files using various TTS providers
Hi João,
Here's the PT translation file.
Here's the PT translation file.
Re: A tool to generate voice files using various TTS providers
Thanks, will try to get it back to you later..
João
João
My er9x/Ersky9x/eepskye Video Tutorials
https://www.youtube.com/playlist?list=PL5uJhoD7sAKidZmkhMpYpp_qcuIqJXhb9
Donate to Er9x/Ersky9x:
https://www.paypal.com/cgi-bin/webscr?cmd=_s-xclick&hosted_button_id=YHX43JR3J7XGW
https://www.youtube.com/playlist?list=PL5uJhoD7sAKidZmkhMpYpp_qcuIqJXhb9
Donate to Er9x/Ersky9x:
https://www.paypal.com/cgi-bin/webscr?cmd=_s-xclick&hosted_button_id=YHX43JR3J7XGW
Re: A tool to generate voice files using various TTS providers
Here is the translation file back to you. Not easy
João
João
- Attachments
-
- TranslationsAtWork_pt-pt.csv
- (3.17 KiB) Downloaded 423 times
My er9x/Ersky9x/eepskye Video Tutorials
https://www.youtube.com/playlist?list=PL5uJhoD7sAKidZmkhMpYpp_qcuIqJXhb9
Donate to Er9x/Ersky9x:
https://www.paypal.com/cgi-bin/webscr?cmd=_s-xclick&hosted_button_id=YHX43JR3J7XGW
https://www.youtube.com/playlist?list=PL5uJhoD7sAKidZmkhMpYpp_qcuIqJXhb9
Donate to Er9x/Ersky9x:
https://www.paypal.com/cgi-bin/webscr?cmd=_s-xclick&hosted_button_id=YHX43JR3J7XGW
Re: A tool to generate voice files using various TTS providers
Thanks João, I have updated the 2.8.1.2 release with your translations
Re: A tool to generate voice files using various TTS providers
Thanks.. I have changed a couple lines as the text was not showing completely in some cases.
João
João
My er9x/Ersky9x/eepskye Video Tutorials
https://www.youtube.com/playlist?list=PL5uJhoD7sAKidZmkhMpYpp_qcuIqJXhb9
Donate to Er9x/Ersky9x:
https://www.paypal.com/cgi-bin/webscr?cmd=_s-xclick&hosted_button_id=YHX43JR3J7XGW
https://www.youtube.com/playlist?list=PL5uJhoD7sAKidZmkhMpYpp_qcuIqJXhb9
Donate to Er9x/Ersky9x:
https://www.paypal.com/cgi-bin/webscr?cmd=_s-xclick&hosted_button_id=YHX43JR3J7XGW
- KAL
- Posts: 552
- Joined: Thu Oct 10, 2013 4:36 pm
- Country: Germany
- Location: somewhere near Rothenburg ob der Tauber
Re: A tool to generate voice files using various TTS providers
Hi folks,
first of all a big thumpup and thank to Caffeine for creating this tool.
I simply have a problem with numbers (maybe there's a simple solution I just fail to see):
Every generated sound file starts with about 0.1 sec of silence. For a single file that's no problem
but composite numbers like e.g. 1234 sound a bit chopped/scattered ...
Does anyone have a simple solution for that (cutting the first 0.1sec)?
(changing 129 sound files by hand may be although a solution but if s.o. knows a better way ... )
Klaus
first of all a big thumpup and thank to Caffeine for creating this tool.
I simply have a problem with numbers (maybe there's a simple solution I just fail to see):
Every generated sound file starts with about 0.1 sec of silence. For a single file that's no problem
but composite numbers like e.g. 1234 sound a bit chopped/scattered ...
Does anyone have a simple solution for that (cutting the first 0.1sec)?
(changing 129 sound files by hand may be although a solution but if s.o. knows a better way ... )
Klaus
( TH9X / 9XTreme / FrSky DHT / Spektrum / Multi / RotEnc )
( 9XR PRO / erSKY9x / FrSky DHT / Spektrum / Multi / RotEnc / P3 as Slider )
( 9XR PRO / erSKY9x / FrSky DHT / Spektrum / Multi / RotEnc / P3 as Slider )
Re: A tool to generate voice files using various TTS providers
Caffeine, just another idea.. What about a button "Go to voice folder" or similar. It would open the main folder where the files are using windows explorer.. What do you think?
João
João
My er9x/Ersky9x/eepskye Video Tutorials
https://www.youtube.com/playlist?list=PL5uJhoD7sAKidZmkhMpYpp_qcuIqJXhb9
Donate to Er9x/Ersky9x:
https://www.paypal.com/cgi-bin/webscr?cmd=_s-xclick&hosted_button_id=YHX43JR3J7XGW
https://www.youtube.com/playlist?list=PL5uJhoD7sAKidZmkhMpYpp_qcuIqJXhb9
Donate to Er9x/Ersky9x:
https://www.paypal.com/cgi-bin/webscr?cmd=_s-xclick&hosted_button_id=YHX43JR3J7XGW
Re: A tool to generate voice files using various TTS providers
I think the biggest delay is the radio selecting the files, not the files themselves??KAL wrote:Hi folks,
first of all a big thumpup and thank to Caffeine for creating this tool.
I simply have a problem with numbers (maybe there's a simple solution I just fail to see):
Every generated sound file starts with about 0.1 sec of silence. For a single file that's no problem
but composite numbers like e.g. 1234 sound a bit chopped/scattered ...
Does anyone have a simple solution for that (cutting the first 0.1sec)?
(changing 129 sound files by hand may be although a solution but if s.o. knows a better way ... )
Klaus
Hmmm, ok, but maybe the 0.3 seconds at the end of the file is worse than the 0.1 at the beginning.. Add them together and you have half a second delay between 2 consecutive files I can see what you mean..
João
My er9x/Ersky9x/eepskye Video Tutorials
https://www.youtube.com/playlist?list=PL5uJhoD7sAKidZmkhMpYpp_qcuIqJXhb9
Donate to Er9x/Ersky9x:
https://www.paypal.com/cgi-bin/webscr?cmd=_s-xclick&hosted_button_id=YHX43JR3J7XGW
https://www.youtube.com/playlist?list=PL5uJhoD7sAKidZmkhMpYpp_qcuIqJXhb9
Donate to Er9x/Ersky9x:
https://www.paypal.com/cgi-bin/webscr?cmd=_s-xclick&hosted_button_id=YHX43JR3J7XGW
- KAL
- Posts: 552
- Joined: Thu Oct 10, 2013 4:36 pm
- Country: Germany
- Location: somewhere near Rothenburg ob der Tauber
Re: A tool to generate voice files using various TTS providers
In the meantime I used audacity to cut silence by a batch job but this cuts every little piece of silence.
That also doesn't sound good ...
That also doesn't sound good ...
( TH9X / 9XTreme / FrSky DHT / Spektrum / Multi / RotEnc )
( 9XR PRO / erSKY9x / FrSky DHT / Spektrum / Multi / RotEnc / P3 as Slider )
( 9XR PRO / erSKY9x / FrSky DHT / Spektrum / Multi / RotEnc / P3 as Slider )
Re: A tool to generate voice files using various TTS providers
I think You can configure audacity to cut only if the silence is longer than "n"..
João
João
My er9x/Ersky9x/eepskye Video Tutorials
https://www.youtube.com/playlist?list=PL5uJhoD7sAKidZmkhMpYpp_qcuIqJXhb9
Donate to Er9x/Ersky9x:
https://www.paypal.com/cgi-bin/webscr?cmd=_s-xclick&hosted_button_id=YHX43JR3J7XGW
https://www.youtube.com/playlist?list=PL5uJhoD7sAKidZmkhMpYpp_qcuIqJXhb9
Donate to Er9x/Ersky9x:
https://www.paypal.com/cgi-bin/webscr?cmd=_s-xclick&hosted_button_id=YHX43JR3J7XGW
Re: A tool to generate voice files using various TTS providers
Hi Klaus,KAL wrote:I simply have a problem with numbers (maybe there's a simple solution I just fail to see):
Every generated sound file starts with about 0.1 sec of silence. For a single file that's no problem
but composite numbers like e.g. 1234 sound a bit chopped/scattered ...
Klaus
Is this with all of the TTS engines, or only one (Ivona, Google etc)?
That's a good idea! I'll look at doing that. thanksjhsa wrote:Caffeine, just another idea.. What about a button "Go to voice folder" or similar. It would open the main folder where the files are using windows explorer.. What do you think?
João
Release 2.8.2.0
- Add functionality to open a recent phrase file
- Add functionality to open the output directory in Windows Explorer
Re: A tool to generate voice files using various TTS providers
A couple things that should probably be changed?
I have to uninstall the previous version before installing the new one? Couldn't that be done automatically?
I also lose all my settings because when uninstalling the previous version, the folder is erased, and the settings file is also erased with it
João
I have to uninstall the previous version before installing the new one? Couldn't that be done automatically?
I also lose all my settings because when uninstalling the previous version, the folder is erased, and the settings file is also erased with it
João
My er9x/Ersky9x/eepskye Video Tutorials
https://www.youtube.com/playlist?list=PL5uJhoD7sAKidZmkhMpYpp_qcuIqJXhb9
Donate to Er9x/Ersky9x:
https://www.paypal.com/cgi-bin/webscr?cmd=_s-xclick&hosted_button_id=YHX43JR3J7XGW
https://www.youtube.com/playlist?list=PL5uJhoD7sAKidZmkhMpYpp_qcuIqJXhb9
Donate to Er9x/Ersky9x:
https://www.paypal.com/cgi-bin/webscr?cmd=_s-xclick&hosted_button_id=YHX43JR3J7XGW
Re: A tool to generate voice files using various TTS providers
could you please send me the PT language file with the changes for this version? Or did you add just that output directory line to the previous one?? If it was only that line then you should replace "directório de saída aberta no windows explorer" with "Abrir Pasta no Windows Explorer"
Thanks
Thanks
My er9x/Ersky9x/eepskye Video Tutorials
https://www.youtube.com/playlist?list=PL5uJhoD7sAKidZmkhMpYpp_qcuIqJXhb9
Donate to Er9x/Ersky9x:
https://www.paypal.com/cgi-bin/webscr?cmd=_s-xclick&hosted_button_id=YHX43JR3J7XGW
https://www.youtube.com/playlist?list=PL5uJhoD7sAKidZmkhMpYpp_qcuIqJXhb9
Donate to Er9x/Ersky9x:
https://www.paypal.com/cgi-bin/webscr?cmd=_s-xclick&hosted_button_id=YHX43JR3J7XGW
Re: A tool to generate voice files using various TTS providers
Also,
Thanks
João
EDIT: opening the folder is really useful, thanks
Thanks
João
EDIT: opening the folder is really useful, thanks
My er9x/Ersky9x/eepskye Video Tutorials
https://www.youtube.com/playlist?list=PL5uJhoD7sAKidZmkhMpYpp_qcuIqJXhb9
Donate to Er9x/Ersky9x:
https://www.paypal.com/cgi-bin/webscr?cmd=_s-xclick&hosted_button_id=YHX43JR3J7XGW
https://www.youtube.com/playlist?list=PL5uJhoD7sAKidZmkhMpYpp_qcuIqJXhb9
Donate to Er9x/Ersky9x:
https://www.paypal.com/cgi-bin/webscr?cmd=_s-xclick&hosted_button_id=YHX43JR3J7XGW
Re: A tool to generate voice files using various TTS providers
Thanks João, I'll investigate.
- KAL
- Posts: 552
- Joined: Thu Oct 10, 2013 4:36 pm
- Country: Germany
- Location: somewhere near Rothenburg ob der Tauber
Re: A tool to generate voice files using various TTS providers
Caffeine wrote:Hi Klaus,
Is this with all of the TTS engines, or only one (Ivona, Google etc)?
- ivona:...........................delay at start seems to be about 0,10s - delay at end about 0,30s
- google:.........................delay at start seems to be about 0,05s - delay at end about 0,15s
- microsoft:.....................delay at start seems to be about 0,15s - delay at end about 0,70s
- Bing text to speech:.........isn't working on my tablet
- From text to speech:........isn't working on my tablet
( TH9X / 9XTreme / FrSky DHT / Spektrum / Multi / RotEnc )
( 9XR PRO / erSKY9x / FrSky DHT / Spektrum / Multi / RotEnc / P3 as Slider )
( 9XR PRO / erSKY9x / FrSky DHT / Spektrum / Multi / RotEnc / P3 as Slider )
- KAL
- Posts: 552
- Joined: Thu Oct 10, 2013 4:36 pm
- Country: Germany
- Location: somewhere near Rothenburg ob der Tauber
Re: A tool to generate voice files using various TTS providers
Found parameters for audacity's "truncate silence" that give results I can live with
Klaus
Klaus
( TH9X / 9XTreme / FrSky DHT / Spektrum / Multi / RotEnc )
( 9XR PRO / erSKY9x / FrSky DHT / Spektrum / Multi / RotEnc / P3 as Slider )
( 9XR PRO / erSKY9x / FrSky DHT / Spektrum / Multi / RotEnc / P3 as Slider )
Re: A tool to generate voice files using various TTS providers
Hi João, I've updated the 2.8.2.0 release with your translations.
I'm still investigating how to retain the settings when a new version is installed.
I'm still investigating how to retain the settings when a new version is installed.
Re: A tool to generate voice files using various TTS providers
Thank you..
João
João
My er9x/Ersky9x/eepskye Video Tutorials
https://www.youtube.com/playlist?list=PL5uJhoD7sAKidZmkhMpYpp_qcuIqJXhb9
Donate to Er9x/Ersky9x:
https://www.paypal.com/cgi-bin/webscr?cmd=_s-xclick&hosted_button_id=YHX43JR3J7XGW
https://www.youtube.com/playlist?list=PL5uJhoD7sAKidZmkhMpYpp_qcuIqJXhb9
Donate to Er9x/Ersky9x:
https://www.paypal.com/cgi-bin/webscr?cmd=_s-xclick&hosted_button_id=YHX43JR3J7XGW
Re: A tool to generate voice files using various TTS providers
I'm implementing a wav file editor, that will allow you to modify voice files after they are created (including removing silence)
Would a mechanism to detect and remove silence automatically be helpful?
Would a mechanism to detect and remove silence automatically be helpful?
Re: A tool to generate voice files using various TTS providers
wooow, you never stop surprising us, do you??
I thought the tool was already perfect But there you are with a super idea Thank you.
Yes, remove silence automatically and manually??
Adjust Gain/normalize the files?
Equalization? With preview? Different radios with different speakers
Batch process the complete folder(s)?
Thanks
João
I thought the tool was already perfect But there you are with a super idea Thank you.
Yes, remove silence automatically and manually??
Adjust Gain/normalize the files?
Equalization? With preview? Different radios with different speakers
Batch process the complete folder(s)?
Thanks
João
My er9x/Ersky9x/eepskye Video Tutorials
https://www.youtube.com/playlist?list=PL5uJhoD7sAKidZmkhMpYpp_qcuIqJXhb9
Donate to Er9x/Ersky9x:
https://www.paypal.com/cgi-bin/webscr?cmd=_s-xclick&hosted_button_id=YHX43JR3J7XGW
https://www.youtube.com/playlist?list=PL5uJhoD7sAKidZmkhMpYpp_qcuIqJXhb9
Donate to Er9x/Ersky9x:
https://www.paypal.com/cgi-bin/webscr?cmd=_s-xclick&hosted_button_id=YHX43JR3J7XGW
Re: A tool to generate voice files using various TTS providers
All good ideas
The library I'm using does have a built in compressor (to compress the dynamic range of the audio, rather than compress the digital file) but in the words of the author:
The library I'm using does have a built in compressor (to compress the dynamic range of the audio, rather than compress the digital file) but in the words of the author:
I haven't figured out how to use it yetto be honest, I need to make this a bit easier to work with
Re: A tool to generate voice files using various TTS providers
yeah, compressors are not very easy to work with, at least for me
Try to play with it in Audacity..
João
Try to play with it in Audacity..
João
My er9x/Ersky9x/eepskye Video Tutorials
https://www.youtube.com/playlist?list=PL5uJhoD7sAKidZmkhMpYpp_qcuIqJXhb9
Donate to Er9x/Ersky9x:
https://www.paypal.com/cgi-bin/webscr?cmd=_s-xclick&hosted_button_id=YHX43JR3J7XGW
https://www.youtube.com/playlist?list=PL5uJhoD7sAKidZmkhMpYpp_qcuIqJXhb9
Donate to Er9x/Ersky9x:
https://www.paypal.com/cgi-bin/webscr?cmd=_s-xclick&hosted_button_id=YHX43JR3J7XGW
Re: A tool to generate voice files using various TTS providers
I'm finding visualising the waveforms very interesting.
For example, here's Ivona's Nicole pronouncing 'Battery Capacity Warning' at XLoud and XSoft volume levels:
I'm looking at whether I can implement a relative volume scale, perhaps allow you to visualise all of the waveforms in a folder and identify if any need adjustment.
For example, here's Ivona's Nicole pronouncing 'Battery Capacity Warning' at XLoud and XSoft volume levels:
I'm looking at whether I can implement a relative volume scale, perhaps allow you to visualise all of the waveforms in a folder and identify if any need adjustment.
Re: A tool to generate voice files using various TTS providers
Normally audio programs have a db scale in the wave editor.
I do think everybody will try to max the volume as loud as they can without distortion.
The files should all be normalized so they all have more or less the same volume. Visualizing the wav would certainly help
João
I do think everybody will try to max the volume as loud as they can without distortion.
The files should all be normalized so they all have more or less the same volume. Visualizing the wav would certainly help
João
My er9x/Ersky9x/eepskye Video Tutorials
https://www.youtube.com/playlist?list=PL5uJhoD7sAKidZmkhMpYpp_qcuIqJXhb9
Donate to Er9x/Ersky9x:
https://www.paypal.com/cgi-bin/webscr?cmd=_s-xclick&hosted_button_id=YHX43JR3J7XGW
https://www.youtube.com/playlist?list=PL5uJhoD7sAKidZmkhMpYpp_qcuIqJXhb9
Donate to Er9x/Ersky9x:
https://www.paypal.com/cgi-bin/webscr?cmd=_s-xclick&hosted_button_id=YHX43JR3J7XGW