If you have a problem or need to report a bug please email : support@dsprobotics.com
There are 3 sections to this support area:
DOWNLOADS: access to product manuals, support files and drivers
HELP & INFORMATION: tutorials and example files for learning or finding pre-made modules for your projects
USER FORUMS: meet with other users and exchange ideas, you can also get help and assistance here
NEW REGISTRATIONS - please contact us if you wish to register on the forum
Users are reminded of the forum rules they sign up to which prohibits any activity that violates any laws including posting material covered by copyright
Speech analysis and re-synthesis
10 posts
• Page 1 of 1
Speech analysis and re-synthesis
Here is some fun. In this schematic I use La Voz Cantante as an engine for speech synthesis. A recorded piece of spoken (or sung) text is reproduced with ful and independent control over pitch, pitch variation, formant shift and speed/duration. My use of it is mainly to analyze the synthesized material in super slow motion in order to test and improve my own algos, but you can also use it just for fun!
- Attachments
-
- SayItDifferent2.fsm
- (558.67 KiB) Downloaded 1117 times
-
martinvicanek - Posts: 1328
- Joined: Sat Jun 22, 2013 8:28 pm
Re: Speech analysis and re-synthesis
Sounds like a pretty genius idea. I'll try and figure it out.
-
wlangfor@uoguelph.ca - Posts: 912
- Joined: Tue Apr 03, 2018 5:50 pm
- Location: North Bay, Ontario, Canada
Re: Speech analysis and re-synthesis
That's really cool, Martin. Great work
Something interesting has caught my eyes and it's the timing. That's something I've been looking for. Would it be possible to sync the timing of any given sample to a given BPM (or in other words, how can we translate the timing values in your schematic to BPM values?)? It would open many possibilities to produce "time synced sample" based toys.
Thanks!
Something interesting has caught my eyes and it's the timing. That's something I've been looking for. Would it be possible to sync the timing of any given sample to a given BPM (or in other words, how can we translate the timing values in your schematic to BPM values?)? It would open many possibilities to produce "time synced sample" based toys.
Thanks!
-
kortezzzz - Posts: 763
- Joined: Tue Mar 19, 2013 4:21 pm
Re: Speech analysis and re-synthesis
Unbelievable!
Great fun to play with but the really impressive thing is the quality of the result. It sounds incredibly real to me over a huge range of parameter values.
You never cease to surprise and amaze me Martin.
Cheers
Spogg
Great fun to play with but the really impressive thing is the quality of the result. It sounds incredibly real to me over a huge range of parameter values.
You never cease to surprise and amaze me Martin.
Cheers
Spogg
-
Spogg - Posts: 3358
- Joined: Thu Nov 20, 2014 4:24 pm
- Location: Birmingham, England
Re: Speech analysis and re-synthesis
martinvicanek wrote:My use of it is mainly to analyze the synthesized material in super slow motion in order to test and improve my own algos, but you can also use it just for fun!
After playing around for a while, I wonder whether there might be uses in speech therapy and language learning for a system like this. I noted in particular that the pitch variation control does a very good job of enhancing, removing, or inverting prosodic cues - for example, the phrase in the included sample seems to change from being a statement to being a question when the pitch variation is inverted.
Such manipulation and/or analysis of prosodic cues, maybe combined with visual feedback, might be a useful tool to supplement sessions with a speech therapist for improving the perception or production of fluent prosody - often found difficult by autistic people, folks with various hearing impairments, aphasias, etc. Likewise, I imagine it could have uses as an aid for learning pronunciation of tonal languages (e.g. most Oriental languages) for learners whose first language is non-tonal.
As a little experiment, I tried it on some recordings of my speech. My prosody is often noted as being very flat by other people (including formally at my Asperger's Syndrome diagnosis), though it doesn't sound that way to me "inside my head" when I'm speaking. Of course, it's hardly a scientific, blinded experiment; but it was interesting to find that exaggerating the pitch variation does indeed seem to make my voice seem more "typical" of what I hear in other people's voices - yet the excellent quality of the processing is such that it remains recognisable as my voice rather than a different speaker.
I had a great time using it "just for fun" too, of course. But, as ever, I think you are too modest; tools like these, in the right hands, may have the potential to be much more than just "toys" or DSP coding aids!
All schematics/modules I post are free for all to use - but a credit is always polite!
Don't stagnate, mutate to create!
Don't stagnate, mutate to create!
-
trogluddite - Posts: 1730
- Joined: Fri Oct 22, 2010 12:46 am
- Location: Yorkshire, UK
Re: Speech analysis and re-synthesis
Brilliant idea trog!
I’ve been wondering about using several of these to create harmonic singing multi-tracking stuff from a single voice.
But I haven’t tested that out yet.
I think a lot could be done with this wonderful tool.
Cheers
Spogg
I’ve been wondering about using several of these to create harmonic singing multi-tracking stuff from a single voice.
But I haven’t tested that out yet.
I think a lot could be done with this wonderful tool.
Cheers
Spogg
-
Spogg - Posts: 3358
- Joined: Thu Nov 20, 2014 4:24 pm
- Location: Birmingham, England
Re: Speech analysis and re-synthesis
This demo I did uses a single voice (the ugly one that you hear at the beginning) as input.
https://vicanek.de/audioprocessing/imag ... s_demo.mp3
https://vicanek.de/audioprocessing/imag ... s_demo.mp3
-
martinvicanek - Posts: 1328
- Joined: Sat Jun 22, 2013 8:28 pm
Re: Speech analysis and re-synthesis
Hi Martin,
Really like this a super lot! Great for speech therapy, special effects, voice over dubbing ( multitracking), and so on.
Does anyone except me though notice some kind of clicking noise even when noise is set to zero, or it it just my computer, daw, me, or, etc..
Later then, BobF.....
Really like this a super lot! Great for speech therapy, special effects, voice over dubbing ( multitracking), and so on.
Does anyone except me though notice some kind of clicking noise even when noise is set to zero, or it it just my computer, daw, me, or, etc..
Later then, BobF.....
- BobF
- Posts: 598
- Joined: Mon Apr 20, 2015 9:54 pm
Re: Speech analysis and re-synthesis
Thanks guys. I'll show it to my wife who is a foreign languages teacher.
-
martinvicanek - Posts: 1328
- Joined: Sat Jun 22, 2013 8:28 pm
Re: Speech analysis and re-synthesis
martinvicanek wrote:This demo I did uses a single voice (the ugly one that you hear at the beginning) as input.
https://vicanek.de/audioprocessing/imag ... s_demo.mp3
Wonderful!
This is surely a commercial product...
Cheers
Spogg
-
Spogg - Posts: 3358
- Joined: Thu Nov 20, 2014 4:24 pm
- Location: Birmingham, England
10 posts
• Page 1 of 1
Who is online
Users browsing this forum: No registered users and 52 guests