journal article Jan 01, 2024

Overview of Speaker Modeling and Its Applications: From the Lens of Deep Speaker Representation Learning

View at Publisher Save 10.1109/taslp.2024.3492793
Topics

No keywords indexed for this article. Browse by subject →

References
404
[1]
Voice Biometric: A Technology for Voice Based Authentication

Nilu Singh, Alka Agrawal, R. A. Khan

Advanced Science, Engineering and Medicine 10.1166/asem.2018.2219
[2]
Kiktova "Speaker recognition for surveillance application" J. Elect. Electron. Eng. (2015)
[12]
Arik "Neural voice cloning with a few samples" (2018)
[13]
Jia "Transfer learning from speaker verification to multispeaker text-to-speech synthesis" (2018)
[16]
Tomashenko "The voiceprivacy 2024 challenge evaluation plan" (2024)
[30]
Vector quantization in speech coding

J. Makhoul, S. Roucos, H. Gish

Proceedings of the IEEE 10.1109/proc.1985.13340
[37]
Kenny "Joint factor analysis of speaker and session variability: Theory and algorithms" (2005)
[43]
Zeinali "But system description to voxceleb speaker recognition challenge" (2019)
[46]
Krizhevsky "Imagenet classification with deep convolutional neural networks" (2012)
[48]
Representation Learning: A Review and New Perspectives

Y. Bengio, A. Courville, P. Vincent

IEEE Transactions on Pattern Analysis and Machine... 10.1109/tpami.2013.50

Showing 50 of 404 references

Cited By
19
An End-to-End Overview of Clinical Speech AI

Si-Ioi Ng, Lingfeng Xu · 2026

IEEE Transactions on Audio, Speech...
IEEE Transactions on Information Fo...
Metrics
19
Citations
404
References
Details
Published
Jan 01, 2024
Vol/Issue
32
Pages
4971-4998
License
View
Funding
Shenzhen Science and Technology Program Award: ZDSYS20230626091302006
Shanghai Municipal Science and Technology Commission Project Award: 2021SHZDZX0102
Shenzhen Science and Technology Research Fund Award: JCYJ20220818103001002
China NSFC projects Award: 62401377
Internal Project of Shenzhen Research Institute of Big Data Award: T00120220002
Cite This Article
Shuai Wang, Zhengyang Chen, Kong Aik Lee, et al. (2024). Overview of Speaker Modeling and Its Applications: From the Lens of Deep Speaker Representation Learning. IEEE/ACM Transactions on Audio, Speech, and Language Processing, 32, 4971-4998. https://doi.org/10.1109/taslp.2024.3492793
Related

You May Also Like