Underjord is a tiny, wholesome team doing Elixir consulting and contract work. If             you             like             the writin

Voice Activity Detection in Elixir and Membrane

submited by
Style Pass
2024-11-27 08:00:04

Underjord is a tiny, wholesome team doing Elixir consulting and contract work. If you like the writing you should really try the code. See our services for more information.

I hacked on something quite useful in the last few weeks, off and on. Voice Activity Detection in Elixir with Silero VAD through ONNX. I’ll show what I did and try to give an idea of what it is and why it is useful.

It boiled down to this gist as a proof of concept. Should work on most Elixir installs. These days Membrane will even try to pull pre-compiled dependencies for the libraries it wants. This was pleasant news as it can otherwise be a hassle to pull the right shared libraries for media processing. I had completely missed that this was added.

Using Silero VAD from Elixir is not something I discovered or was first to write about. I leaned on this post by Sean Moriarty for DockYard. Finding the right version of Silero VAD to match Ortex was a bit of a hassle because the model has changed significantly and is out of step with Ortex, the Elixir library based on the Rust library Ort which provides and ONNX runtime. Got some good help in the Erlang Ecosystem Foundation’s Slack, the #machine-learning channel is where that stuff happens. Shout-out and thanks to Travis Morton and Andrés Alejos.

Leave a Comment