We have moved to a new Sailfish OS Forum. Please start new discussions there.
0

speech recognition for dictation which works for sailfish?

asked 2020-02-02 20:13:47 +0300

cemoi71 gravatar image

hello Sailors,

i would like to use a speech recognition for dictation usage.
Means i would like to speak to create a text.
A software like the dragon naturally speaking from nuance.

It should recognize French and German for me (for the work), maybe English too.
Ideally, it should work off-line and directly.
Online could be interesting but in 2nd choice.

And nevermind if it's for sailfish native, or android.

Has someone any experience on it please?

Will be very welcome. Thx for effort in advance.

Have nice sail

edit retag flag offensive close delete

1 Answer

Sort by » oldest newest most voted
1

answered 2020-02-03 04:28:16 +0300

DrYak gravatar image

updated 2020-02-03 11:38:03 +0300

Currently there are few speech recognition systems which aren't running on some neural net AI (LSTM).

Sphinx has a mobile-friendly system called PocketSphinx, that runs some simpler machine learning system that can fit in a mobbile setting and can be trained on a limited subset of the language corpus.

(But for other mobile-friendly see at the end).

The rest are all AI neural net based, which is much more ressource intense.

  • most systems (including the mobile version of Dragon dictate) just stream their audio inputs to some cloud server (Google is another example, and on openrepos you can find Saera for Sailfish which also is cloud-based).

  • a few of the ultra modern high range phones have built-in accelerators that can help with the kind of matrix maths used in neural nets. Saddly none of them are currently running Sailfish OS.

  • the last alternative would be to run your own server. (either on computers with GPUs, or on special SBC that have an ARM with an AI accelerator - see RK1808 based AI Module such as, e.g.: pine64's SoEdge)

  • a fall back instead is to fall back on mobile-friendly (see below).

There are opensource neural-net based technologies:

  • Mozilla has been doing work on opensource technologies (DeepSpeech), they have introduced revently a TFLite version that can also run on ARM.
  • Mycroft has also opensource codecode, that can be run on ARM for the speech-to-text part, it's partnering with Mozilla (and in the past has used pocketsphinx for the wake-word recognition).

There are also cloud provider who can propose you to spin up closed-source cloud instance dedicated for that:

  • I think I've seen it proposed on Microsoft Azure (you can get your own speech cloud VM), but I haven't much experience (professionally, we're a research group using more classical statistical analysis. Little NN done currently).
edit flag offensive delete publish link more

Comments

Mozilla's DeepSpeech has TFLite version for ARM, updated accordingly.

DrYak ( 2020-02-03 11:39:06 +0300 )edit
1

Saera uses julius so like pocketsphinx offline and on-device afaik (though seems to be broken since 2018 or so based on comments)

szopin ( 2020-02-03 16:04:01 +0300 )edit

Thank you very much both! And did someone tested it an sfos environment?

cemoi71 ( 2020-02-03 23:19:40 +0300 )edit
Login/Signup to Answer

Question tools

Follow
4 followers

Stats

Asked: 2020-02-02 20:13:47 +0300

Seen: 430 times

Last updated: Feb 03 '20