AI@lemmy.ml · 1 year ago

[voice recognition] Audio tools for generating datasets?

1

4

[voice recognition] Audio tools for generating datasets?

AI@lemmy.ml · 1 year ago

1

This is more of personal project to learn more about how speech recognition (SR) works and how AI training works at a low level. (Functionally, it’s pointless and is just a self-assigned “homework problem”)

To do this, I need to record a bit of audio to to use as training data.

Recording and chopping up .wav files is easy, but it’s time consuming. I am toying with my own teleprompter-like python app that will prompt for a word, record and tag, and save for later. However, is there a good app to automatically create utterances that is already built?

Ideally, unrecognized words in my own SR system would be automatically turned into tagged audio clips to be used for re-training or fine tuning.

I am shortcutting a bit of this work in python with Google SR for my first dataset. Unfortunately, calling external APIs is sidestepping my intent of this project so I’ll move away from that soon.

People that work with AI typically work with lots of data, so I figured here was a good place to ask.

You must log in or # to comment.

Chat

remoteloveOP
link
fedilink
arrow-up
1·
1 year ago
I found this as a start: https://github.com/cmusphinx/pocketsphinx/blob/master/cython/pocketsphinx/segmenter.py

AI@lemmy.ml

artificial_intel@lemmy.ml

You are not logged in. However you can subscribe from another Fediverse account, for example Lemmy or Mastodon. To do this, paste the following into the search field of your instance: [email protected]

Artificial intelligence (AI) is intelligence demonstrated by machines, unlike the natural intelligence displayed by humans and animals, which involves consciousness and emotionality. The distinction between the former and the latter categories is often revealed by the acronym chosen.

Visibility: Public

This community can be federated to other instances and be posted/commented in by their users.

7 users / day
48 users / week
741 users / month
1.6K users / 6 months
94 local subscribers
5.17K subscribers
590 Posts
1.77K Comments
Modlog

mods: