Missed a bunch of these reviews so will refrain from being elaborate this time. I am also starting to add a timestamp in every capture from now on so I can clearly say which things happened in last week and which earlier.

1 Experiments

  • I have been thinking about a more programmer friendly API for emacs-speech-input as I don't really like the boring FSM-ish conversational framework. Looking at the whole process as a probabilistic parser running on audio streams can work out. Plus you, probably, get to design flows using something like parser combinators.

2 Readings/Explorations

3 Programming

Commits for week 46-2019 and 4 previous weeks.

4 Media

Bibliography

  • [wang2018speaker] Wang, Downey, Wan, Mansfield & Moreno. 2018. "Speaker diarization with lstm", 5239-5243, in in: 2018 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), edited by
  • [zhang2018fully] "Aonan Zhang, Quan Wang, Zhenyao Zhu, John Paisley & Chong Wang". 2018. "Fully Supervised Speaker Diarization." "arXiv preprint arXiv:1810.04719", , link. doi.
  • [xie2019utterance] Xie, Nagrani, Chung & Zisserman. 2019. "Utterance-level Aggregation for Speaker Recognition in the Wild", 5791-5795, in in: ICASSP 2019-2019 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), edited by