How can I get data incides where speech starts or ends, instead of time when the same happens?
How can I get data incides where speech starts or ends, instead of time when the same happens?