Original topic:

Subtitle videos with the help of speech recognition technology

(Topic created on: 11-05-2022 04:31 AM)
197 Views
opphawks
Active Level 3
Options
Others

Today subtitling online videos is a practice on the rise, not in vain YouTube offers the option of automatic subtitling even when it is not 100% perfect, and it is precisely on this point where we want to focus:

And if we add to the above The trend of seeing the generar subtitulos automaticos  is more than evident why many companies and media are investing in subtitling videos. But let's not go into more detail about this for now.

Let's return to our focus: to obtain the subtitles we can use voice recognition technology. It is a tool that automates and speeds up the manual work of transcription to later take it to subtitles; in other words: it converts audio/video to text, albeit with some limitations. How does it work?

Voice recognition technology We

concentrate on points 1 and 2, the limits of voice recognition technology are, for example, its inaccuracy in recognizing words or ways of speaking (slang, accents...), transcribing what was said by more than a speaker and mark all the punctuation marks. It is also affected by environmental factors, that is, if the ambient sound is very strong, the voice recognition will be weak.

The good thing is that these flaws are constantly corrected. The companies that offer this type of service have as a priority the development of their platforms so that they are increasingly intelligent. Siri already understands us more.

Free and paid tools to get subtitles



We have options like YouTube, which is not bad at all because it allows us to obtain automatic subtitles or upload our own. Therefore, it is easy for you if certain transcription errors do not bother you and if the videos will only be published in this medium.

Other tools are those for dictation, such as GoogleDoc, Dictation or Speechnotes which, as their name indicates, are not exactly designed to obtain subtitles with their timecode, but many people use them because they are free.

With a bit of inventiveness, many users activate the microphone of these platforms and immediately start shooting the video so that it “dictates” what should be transcribed. Works? If the audio is good, the result will also be good.

The detail of these dictation tools is that their process is slow. The transcription is done at the same time as the video, that is, if it lasts an hour, you will have to wait an hour for the platform to transcribe it. Also, you won't be able to export such text in .srt format, which means you have to manually complete subtitle sync.

2 Comments
Murtazamemon
Active Level 7
Others
Am sorry why you post this here?
0 Likes
martinlinda43
Active Level 1
Others

Meta Kuwait Portal is a central platform of Kuwait to manage government appointments for all residents. The residents of Kuwait often need to visit.

0 Likes