Associate Professor Vidhyasaharan Sethu

Associate Professor

PhD (UNSW), MEngSc in Signal Processing (UNSW), BE in Electronics and Communication Engineering (Anna University)

Engineering

Electrical Engineering and Telecommunications

Vidhyasaharan Sethu is an Associate Professor with the School of Electrical Engineering and Telecommunications. His primary research interests are in the field of speech signal processing. Particularly in the application of machine learning techniques for addressing speech processing tasks. His research interests include speech based emotion and mental state recognition systems, affective computing, voice biometrics and more broadly the overlap between machine learning and signal processing.

Phone

+61 2 9385 7737

E-mail

v.sethu@unsw.edu.au

Location

Room 442, EE&T Building (G17), UNSW Sydney

Book Chapters | 2015

Sethu V; Epps J; Ambikairajah E, 2015, 'Speech based emotion recognition', in Speech and Audio Processing for Coding, Enhancement and Recognition, Springer Link, pp. 197 - 228, http://dx.doi.org/10.1007/978-1-4939-1456-2_7

Book Chapters | 2014

Ambikairajah E; Sethu V; Eaton R; Sheng M, 2014, 'Evolving use of educational technologies: Enhancing lectures', in Using Technology Tools to Innovate Assessment Reporting and Teaching Practices in Engineering Education, pp. 241 - 258, http://dx.doi.org/10.4018/978-1-4666-5011-4.ch018
Journal articles | 2026

Meng H; Breebaart J; Stoddard J; Sethu V; Ambikairajah E, 2026, 'A unified deep learning framework for estimating acoustic context parameters from first order ambisonic speech recordings', Journal on Audio, Speech, and Music Processing, http://dx.doi.org/10.1186/s13636-025-00443-0

Journal articles | 2025

Charls D; Sethu V; Ahmed B, 2025, 'Uncertainty-Aware Domain Adaptation for ECG Classification', Proceedings of the Annual International Conference of the IEEE Engineering in Medicine and Biology Society EMBS, http://dx.doi.org/10.1109/EMBC58623.2025.11254147

Journal articles | 2025

Jing M; Sethu V; Ahmed B; Lee KA, 2025, 'Quantifying prediction uncertainties in automatic speaker verification systems', Computer Speech and Language, 94, http://dx.doi.org/10.1016/j.csl.2025.101806

Journal articles | 2025

Wu J; Dang T; Sethu V; Ambikairajah E, 2025, 'How many raters do we need? Analyses of uncertainty in estimating ambiguity-aware emotion labels', IEEE Transactions on Affective Computing, http://dx.doi.org/10.1109/TAFFC.2025.3616071

Journal articles | 2025

Zhang Q; Wickramasinghe B; Ambikairajah E; Sethu V; Li H, 2025, 'Should Audio Front-Ends be Adaptive? Comparing Learnable and Adaptive Front-Ends', IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 33, pp. 998 - 1010, http://dx.doi.org/10.1109/TASLPRO.2025.3542281

Journal articles | 2024

Bose D; Sethu V; Ambikairajah E, 2024, 'Continuous Emotion Ambiguity Prediction: Modeling with Beta Distributions', IEEE Transactions on Affective Computing, 15, pp. 1684 - 1695, http://dx.doi.org/10.1109/TAFFC.2024.3367371

Journal articles | 2024

Haghshenas Y; Wong WP; Gunawan D; Khataee A; Keyikoğlu R; Razmjou A; Kumar PV; Toe CY; Masood H; Amal R; Sethu V; Teoh WY, 2024, 'Predicting the rates of photocatalytic hydrogen evolution over cocatalyst-deposited TiO2 using machine learning with active photon flux as a unifying feature', Ees Catalysis, 2, pp. 612 - 623, http://dx.doi.org/10.1039/d3ey00246b

Journal articles | 2024

Haghshenas Y; Wong WP; Sethu V; Amal R; Kumar PV; Teoh WY, 2024, 'Full prediction of band potentials in semiconductor materials', Materials Today Physics, 46, pp. 101519, http://dx.doi.org/10.1016/j.mtphys.2024.101519

Journal articles | 2024

Hong X; Gong Y; Sethu V; Dang T, 2024, 'AER-LLM: Ambiguity-aware Emotion Recognition Leveraging Large Language Models.', CoRR, abs/2409.18339

Journal articles | 2024

Meng H; Zhang Q; Zhang X; Sethu V; Ambikairajah E, 2024, 'Binaural Selective Attention Model for Target Speaker Extraction', Interspeech 2024, pp. 4323 - 4327, http://dx.doi.org/10.21437/interspeech.2024-683

Journal articles | 2024

Nan Z; Dang T; Sethu V; Ahmed B, 2024, 'A Joint Spectro-Temporal Relational Thinking Based Acoustic Modeling Framework.', CoRR, abs/2409.15357

Journal articles | 2024

Nan Z; Dang T; Sethu V; Ahmed B, 2024, 'Variational Connectionist Temporal Classification for Order-Preserving Sequence Modeling', ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 6495 - 6499, http://dx.doi.org/10.1109/icassp48485.2024.10447530

Journal articles | 2024

Wu J; Dang T; Sethu V; Ambikairajah E, 2024, 'Dual-Constrained Dynamical Neural ODEs for Ambiguity-aware Continuous Emotion Prediction', Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 3185 - 3189, http://dx.doi.org/10.21437/Interspeech.2024-119

Journal articles | 2023

Dimitriadis A; Pan S; Sethu V; Ahmed B, 2023, 'Spatial HuBERT: Self-supervised Spatial Speech Representation Learning for a Single Talker from Multi-channel Audio.', CoRR, abs/2310.10922

Journal articles | 2023

Masood H; Sirojan T; Toe CY; Kumar PV; Haghshenas Y; Sit PHL; Amal R; Sethu V; Teoh WY, 2023, 'Enhancing prediction accuracy of physical band gaps in semiconductor materials', Cell Reports Physical Science, 4, http://dx.doi.org/10.1016/j.xcrp.2023.101555

Journal articles | 2023

Wickramasinghe B; Ambikairajah E; Sethu V; Epps J; Li H; Dang T, 2023, 'DNN controlled adaptive front-end for replay attack detection systems', Speech Communication, 154, http://dx.doi.org/10.1016/j.specom.2023.102973

Journal articles | 2023

Wu J; Dang T; Sethu V; Ambikairajah E, 2023, 'A Novel Markovian Framework for Integrating Absolute and Relative Ordinal Emotion Information', IEEE Transactions on Affective Computing, 14, pp. 2089 - 2101, http://dx.doi.org/10.1109/TAFFC.2022.3159782

Journal articles | 2022

Ramandi HL; Irtza S; Sirojan T; Naman A; Mathew R; Sethu V; Roshan H; Lamei Ramandi H, 2022, 'FracDetect: A novel algorithm for 3D fracture detection in digital fractured rocks', Journal of Hydrology, 607, pp. 127482, http://dx.doi.org/10.1016/j.jhydrol.2022.127482

Journal articles | 2021

Aboutanios E; Sethu V; Ambikairajah E; Taubman DS; Epps J, 2021, 'Teaching Signal Processing through Frequent and Diverse Design: A Pedagogical Approach', IEEE Signal Processing Magazine, 38, pp. 133 - 143, http://dx.doi.org/10.1109/MSP.2021.3057855

Journal articles | 2021

Gunendradasan T; Ambikairajah E; Epps J; Sethu V; Li H, 2021, 'An adaptive transmission line cochlear model based front-end for replay attack detection', Speech Communication, 132, pp. 114 - 122, http://dx.doi.org/10.1016/j.specom.2021.06.004

Journal articles | 2021

Wu J; Dang T; Sethu V; Ambikairajah E, 2021, 'A Novel Markovian Framework for Integrating Absolute and Relative Ordinal Emotion Information.', CoRR, abs/2108.04605

Journal articles | 2021

Wu J; Dang T; Sethu V; Ambikairajah E, 2021, 'Multimodal Affect Models: An Investigation of Relative Salience of Audio and Visual Cues for Emotion Prediction', Frontiers in Computer Science, 3, http://dx.doi.org/10.3389/fcomp.2021.767767

Journal articles | 2020

Cummins N; Sethu V; Epps J; Williamson JR; Quatieri TF; Krajewski J, 2020, 'Generalized two-stage rank regression framework for depression score prediction from speech', IEEE Transactions on Affective Computing, 11, pp. 272 - 283, http://dx.doi.org/10.1109/TAFFC.2017.2766145

Journal articles | 2020

Huang Z; Epps J; Joachim D; Sethu V, 2020, 'Natural Language Processing Methods for Acoustic and Landmark Event-Based Features in Speech-Based Depression Detection', IEEE Journal on Selected Topics in Signal Processing, 14, pp. 435 - 448, http://dx.doi.org/10.1109/JSTSP.2019.2949419

Journal articles | 2020

Suthokumar G; Sriskandaraja K; Sethu V; Ambikairajah E; Li H, 2020, 'An analysis of speaker dependent models in replay detection', Apsipa Transactions on Signal and Information Processing, 9, http://dx.doi.org/10.1017/ATSIP.2020.9

Journal articles | 2019

Brown S; Sethu V; Taubman D, 2019, 'Spatial Wiener filter to reduce spatial aliasing with spherical microphone arrays', Journal of the Acoustical Society of America, 145, pp. 2254 - 2264, http://dx.doi.org/10.1121/1.5096184

Journal articles | 2019

Masood H; Toe CY; Teoh WY; Sethu V; Amal R, 2019, 'Machine Learning for Accelerated Discovery of Solar Photocatalysts', ACS Catalysis, 9, pp. 11774 - 11787, http://dx.doi.org/10.1021/acscatal.9b02531

Journal articles | 2019

Sethu V; Provost EM; Epps J; Busso C; Cummins N; Narayanan SS, 2019, 'The Ambiguous World of Emotion Representation.', CoRR, abs/1909.00360

Journal articles | 2019

Vukovic M; Sethu V; Parker J; Cavedon L; Lech M; Thangarajah J, 2019, 'Estimating cognitive load from speech gathered in a complex real-life training exercise', International Journal of Human Computer Studies, 124, pp. 116 - 133, http://dx.doi.org/10.1016/j.ijhcs.2018.12.003

Journal articles | 2018

Dang T; Sethu V; Ambikairajah E, 2018, 'Compensation Techniques for Speaker Variability in Continuous Emotion Prediction', IEEE Transactions on Affective Computing, pp. 1 - 15, http://dx.doi.org/10.1109/TAFFC.2018.2883044

Journal articles | 2018

Fernando S; Sethu V; Ambikairajah E, 2018, 'Hidden variability subspace learning for adaptation of deep neural networks', Electronics Letters, 54, pp. 173 - 175, http://dx.doi.org/10.1049/el.2017.4027

Journal articles | 2018

Irtza S; Sethu V; Ambikairajah E; Li H, 2018, 'Using language cluster models in hierarchical language identification', Speech Communication, 100, pp. 30 - 40, http://dx.doi.org/10.1016/j.specom.2018.04.004

Journal articles | 2018

Ma J; Sethu V; Ambikairajah E; Lee KA, 2018, 'Generalized variability model for speaker verification', IEEE Signal Processing Letters, 25, pp. 1775 - 1779, http://dx.doi.org/10.1109/LSP.2018.2874814

Journal articles | 2017

Ma J; Sethu V; Ambikairajah E; Lee KA, 2017, 'Duration compensation of i-vectors for short duration speaker verification', Electronics Letters, 53, pp. 405 - 407, http://dx.doi.org/10.1049/el.2016.4629

Journal articles | 2017

Sriskandaraja K; Sethu V; Ambikairajah E; Li H, 2017, 'Front-end for antispoofing countermeasures in speaker verification: Scattering spectral decomposition', IEEE Journal on Selected Topics in Signal Processing, 11, pp. 632 - 643, http://dx.doi.org/10.1109/JSTSP.2016.2647202

Journal articles | 2015

Cummins N; Sethu V; Epps J; Schnieder S; Krajewski J, 2015, 'Analysis of acoustic space variability in speech affected by depression', Speech Communication, 75, pp. 27 - 49, http://dx.doi.org/10.1016/j.specom.2015.09.003

Journal articles | 2015

Thiruvaran T; Sethu V; Ambikairajah E; Li H, 2015, 'Spectral shifting of speaker-specific information for narrow band telephonic speaker recognition', Electronics Letters, http://dx.doi.org/10.1049/el.2015.3117

Journal articles | 2013

Sethu V; Ambikairajah E; Epps J, 2013, 'On the use of speech parameter contours for emotion recognition', Eurasip Journal on Audio Speech and Music Processing, 2013, http://dx.doi.org/10.1186/1687-4722-2013-19

Journal articles | 2011

Ambikairajah E; Li H; Wang L; Yin B; Sethu V, 2011, 'Language Identification: A Tutorial', Circuits and Systems Magazine, IEEE, 11, pp. 82 - 108, http://dx.doi.org/10.1109/MCAS.2011.941081

Journal articles | 2011

Le NP; Ambikairajah E; Epps JR; Sethu V; Choi E, 2011, 'Investigation of spectral centroid features for cognitive load classification', Speech Communication, 53, pp. 540 - 551, http://dx.doi.org/10.1016/j.specom.2011.01.005

Journal articles | 2008

Sethu V; Ambikairajah E; Ge L, 2008, 'Selective weighting of undecimated wavelet coefficients for noise reduction in SAR interferograms', Eurasip Journal on Advances In Signal Processing, pp. 78092 - 78099

Journal articles | 2007

Meng D; Sethu V; Ambikairajah E; Ge L, 2007, 'A novel technique for noise reduction in InSAR images', IEEE Geoscience and Remote Sensing Letters, 4, pp. 226 - 230, http://dx.doi.org/10.1109/LGRS.2006.888845
Working Papers | 2024

Nan Z; Dang T; Sethu V; Ahmed B, 2024, A Joint Spectro-Temporal Relational Thinking Based Acoustic Modeling Framework., http://dx.doi.org

Working Papers | 2023

Dimitriadis A; Pan S; Sethu V; Ahmed B, 2023, Spatial HuBERT: Self-supervised Spatial Speech Representation Learning for a Single Talker from Multi-channel Audio., http://dx.doi.org
Conference Papers | 2025

Ambikairajah E; Sirojan T; Sethu V, 2025, 'Tiered Assessment for DSP Education: Exploring Students' Motivation and Performance', in 2025 Asia Pacific Signal and Information Processing Association Annual Summit and Conference Apsipa ASC 2025, pp. 1847 - 1852, http://dx.doi.org/10.1109/APSIPAASC65261.2025.11249035

Conference Papers | 2025

Ambikairajah E; Wu J; Dang T; Sethu V, 2025, 'A Study of Speech Embedding Similarities Between Australian Aboriginal and High-Resource Languages', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 1498 - 1502, http://dx.doi.org/10.21437/Interspeech.2025-911

Conference Papers | 2025

Dang T; Jeyaseelan TM; Ambikairajah E; Sethu V, 2025, 'Characterization of Speech Similarity Between Australian Aboriginal and High-Resource Languages: A Case Study on Dharawal', in 2025 Asia Pacific Signal and Information Processing Association Annual Summit and Conference Apsipa ASC 2025, pp. 658 - 663, http://dx.doi.org/10.1109/APSIPAASC65261.2025.11249066

Preprints | 2025

Dang T; Jeyaseelan TM; Ambikairajah E; Sethu V, 2025, Characterization of Speech Similarity Between Australian Aboriginal and High-Resource Languages: A Case Study on Dharawal, http://dx.doi.org/10.48550/arxiv.2509.01419

Conference Papers | 2025

Hong X; Gong Y; Sethu V; Dang T, 2025, 'AER-LLM: Ambiguity-aware Emotion Recognition Leveraging Large Language Models', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, http://dx.doi.org/10.1109/ICASSP49660.2025.10888198

Conference Papers | 2025

Hong X; Gong Y; Sethu V; Dang T, 2025, 'AER-LLM: Ambiguity-aware Emotion Recognition Leveraging Large Language Models.', in ICASSP, IEEE, pp. 1 - 5, https://doi.org/10.1109/ICASSP49660.2025

Preprints | 2025

Hong X; Gong Y; Sethu V; Dang T, 2025, AER-LLM: Ambiguity-aware Emotion Recognition Leveraging Large Language Models, http://dx.doi.org/10.48550/arxiv.2409.18339

Conference Papers | 2025

Jing M; Sethu V; Ahmed B, 2025, 'Evidential Neural GPLDA: A Novel Approach to Quantify Prediction Uncertainty in Speaker Verification Systems', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, http://dx.doi.org/10.1109/ICASSP49660.2025.10887887

Conference Papers | 2025

Jing M; Sethu V; Ahmed B, 2025, 'Improved Out-of-domain Detection in VAE Latent Spaces with Boundary-driven Regularisation', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, http://dx.doi.org/10.1109/ICASSP49660.2025.10890806

Conference Papers | 2025

Meng H; Breebaart J; Stoddard J; Sethu V; Ambikairajah E, 2025, 'Blind Estimation of Sub-band Acoustic Parameters from Ambisonics Recordings using Spectro-Spatial Covariance Features', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, http://dx.doi.org/10.1109/ICASSP49660.2025.10887842

Conference Papers | 2025

Meng H; Breebaart J; Stoddard J; Sethu V; Ambikairajah E, 2025, 'Blind Estimation of Sub-band Acoustic Parameters from Ambisonics Recordings using Spectro-Spatial Covariance Features.', in ICASSP, IEEE, pp. 1 - 5, https://doi.org/10.1109/ICASSP49660.2025

Preprints | 2025

Meng H; Sethu V; Ambikairajah E; Zhang Q; Li H, 2025, Adaptive Per-Channel Energy Normalization Front-end for Robust Audio Signal Processing, http://dx.doi.org/10.48550/arxiv.2510.18206

Preprints | 2025

Zhang Q; Wickramasinghe B; Ambikairajah E; Sethu V; Li H, 2025, Should Audio Front-ends be Adaptive? Comparing Learnable and Adaptive Front-ends, http://dx.doi.org/10.48550/arxiv.2502.03260

Conference Papers | 2024

Ambikairajah E; Sirojan T; Sethu V; Mishra D, 2024, 'Aligning Tiered Assessments With Course Learning Outcomes', in 2024 IEEE International Conference on Teaching Assessment and Learning for Engineering Tale 2024 Proceedings, http://dx.doi.org/10.1109/TALE62452.2024.10834314

Conference Papers | 2024

Ambikairajah E; Sirojan T; Thiruvaran T; Sethu V, 2024, 'ChatGPT in the Classroom: A Shift in Engineering Design Education', in IEEE Global Engineering Education Conference Educon, http://dx.doi.org/10.1109/EDUCON60312.2024.10578884

Conference Papers | 2024

Ambikairajah E; Thiruvaran T; Sethu V; Mishra D; Sirojan T, 2024, 'A Tiered Learning Framework for Self-Guided Engineering Design Education', in IEEE Global Engineering Education Conference Educon, http://dx.doi.org/10.1109/EDUCON60312.2024.10578840

Conference Papers | 2024

Jing M; Sethu V; Ahmed B, 2024, 'A PROBABILITY GRADIENT BASED APPROACH FOR SAMPLING BOUNDARIES OF IN-DOMAIN DATA', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, pp. 5340 - 5344, http://dx.doi.org/10.1109/ICASSP48485.2024.10445872

Preprints | 2024

Meng H; Breebaart J; Stoddard J; Sethu V; Ambikairajah E, 2024, Blind Estimation of Sub-band Acoustic Parameters from Ambisonics Recordings using Spectro-Spatial Covariance Features, http://arxiv.org/abs/2411.03172v2

Preprints | 2024

Meng H; Sethu V; Ambikairajah E, 2024, What is Learnt by the LEArnable Front-end (LEAF)? Adapting Per-Channel Energy Normalisation (PCEN) to Noisy Conditions, http://dx.doi.org/10.21437/Interspeech.2023-1617

Conference Papers | 2024

Meng H; Zhang Q; Zhang X; Sethu V; Ambikairajah E, 2024, 'Binaural Selective Attention Model for Target Speaker Extraction', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 4323 - 4327, http://dx.doi.org/10.21437/Interspeech.2024-683

Conference Papers | 2024

Meng H; Zhang Q; Zhang X; Sethu V; Ambikairajah E, 2024, 'Binaural Selective Attention Model for Target Speaker Extraction.', in Lapidot I; Gannot S (ed.), INTERSPEECH, ISCA, https://doi.org/10.21437/Interspeech.2024

Preprints | 2024

Meng H; Zhang Q; Zhang X; Sethu V; Ambikairajah E, 2024, Binaural Selective Attention Model for Target Speaker Extraction, http://arxiv.org/abs/2406.12236v1

Conference Papers | 2024

Nan Z; Dang T; Sethu V; Ahmed B, 2024, 'Variational Connectionist Temporal Classification for Order-Preserving Sequence Modeling', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, pp. 6495 - 6499, http://dx.doi.org/10.1109/ICASSP48485.2024.10447530

Conference Papers | 2024

Nan Z; Dang T; Sethu V; Ahmed B, 2024, 'Variational Connectionist Temporal Classification for Order-Preserving Sequence Modeling.', in ICASSP, IEEE, pp. 6495 - 6499, https://doi.org/10.1109/ICASSP48485.2024

Preprints | 2024

Nan Z; Dang T; Sethu V; Ahmed B, 2024, A Joint Spectro-Temporal Relational Thinking Based Acoustic Modeling Framework, http://dx.doi.org/10.48550/arxiv.2409.15357

Conference Papers | 2024

Wu J; Dang T; Sethu V; Ambikairajah E, 2024, 'Dual-Constrained Dynamical Neural ODEs for Ambiguity-aware Continuous Emotion Prediction.', in Lapidot I; Gannot S (ed.), INTERSPEECH, ISCA, https://doi.org/10.21437/Interspeech.2024

Conference Papers | 2024

Wu J; Dang T; Sethu V; Ambikairajah E, 2024, 'Emotion Recognition Systems Must Embrace Ambiguity', in Proceedings 2024 12th International Conference on Affective Computing and Intelligent Interaction Workshops and Demos Aciiw 2024, pp. 166 - 170, http://dx.doi.org/10.1109/ACIIW63320.2024.00033

Preprints | 2024

Wu J; Dang T; Sethu V; Ambikairajah E, 2024, Dual-Constrained Dynamical Neural ODEs for Ambiguity-aware Continuous Emotion Prediction, http://dx.doi.org/10.48550/arxiv.2407.21344

Conference Papers | 2024

Wu YT; Wu J; Sethu V; Lee CC, 2024, 'Can Modelling Inter-Rater Ambiguity Lead To Noise-Robust Continuous Emotion Predictions?', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 3714 - 3718, http://dx.doi.org/10.21437/Interspeech.2024-482

Conference Papers | 2023

Dang T; Dimitriadis A; Wu J; Sethu V; Ambikairajah E, 2023, 'Constrained Dynamical Neural ODE for Time Series Modelling: A Case Study on Continuous Emotion Prediction', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, http://dx.doi.org/10.1109/ICASSP49357.2023.10095778

Preprints | 2023

Dimitriadis A; Pan S; Sethu V; Ahmed B, 2023, Spatial HuBERT: Self-supervised Spatial Speech Representation Learning for a Single Talker from Multi-channel Audio, http://dx.doi.org/10.48550/arxiv.2310.10922

Conference Papers | 2023

Meng H; Sethu V; Ambikairajah E, 2023, 'What is Learnt by the LEArnable Front-end (LEAF)? Adapting Per-Channel Energy Normalisation (PCEN) to Noisy Conditions', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 2898 - 2902, http://dx.doi.org/10.21437/Interspeech.2023-1617

Conference Papers | 2023

Meng H; Sethu V; Ambikairajah E, 2023, 'What is Learnt by the LEArnable Front-end (LEAF)? Adapting Per-Channel Energy Normalisation (PCEN) to Noisy Conditions.', in Harte N; Carson-Berndsen J; Jones G (eds.), INTERSPEECH, ISCA, pp. 2898 - 2902, https://doi.org/10.21437/Interspeech.2023

Preprints | 2023

Nan Z; Dang T; Sethu V; Ahmed B, 2023, Variational Connectionist Temporal Classification for Order-Preserving Sequence Modeling, http://dx.doi.org/10.48550/arxiv.2309.11983

Conference Papers | 2023

Shahin M; Nan Z; Sethu V; Ahmed B, 2023, 'Improving wav2vec2-based Spoken Language Identification by Learning Phonological Features', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 4119 - 4123, http://dx.doi.org/10.21437/Interspeech.2023-2533

Conference Papers | 2023

Wu J; Dang T; Sethu V; Ambikairajah E, 2023, 'Belief Mismatch Coefficient (BMC): A Novel Interpretable Measure of Prediction Accuracy for Ambiguous Emotion States', in 2023 11th International Conference on Affective Computing and Intelligent Interaction Acii 2023, http://dx.doi.org/10.1109/ACII59096.2023.10388210

Conference Papers | 2023

Wu J; Dang T; Sethu V; Ambikairajah E, 2023, 'From Interval to Ordinal: A HMM based Approach for Emotion Label Conversion', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 1843 - 1847, http://dx.doi.org/10.21437/Interspeech.2023-2213

Conference Papers | 2022

Wu J; Dang T; Sethu V; Ambikairajah E, 2022, 'A NOVEL SEQUENTIAL MONTE CARLO FRAMEWORK FOR PREDICTING AMBIGUOUS EMOTION STATES', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, pp. 8567 - 8571, http://dx.doi.org/10.1109/ICASSP43922.2022.9746350

Conference Papers | 2021

Ahmed B; Ballard K; Burnham D; Sirojan T; Mehmood H; Estival D; Baker E; Cox F; Arciuli J; Benders T; Demuth K; Kelly B; Diskin-Holdaway C; Shahin M; Sethu V; Epps J; Lee CB; Ambikairajah E, 2021, 'AusKidTalk: An auditory-visual corpus of 3-to 12-year-old Australian children's speech', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 4351 - 4355, http://dx.doi.org/10.21437/Interspeech.2021-2000

Conference Papers | 2021

Ahmed B; Ballard KJ; Burnham D; Sirojan T; Mehmood H; Estival D; Baker E; Cox F; Arciuli J; Benders T; Demuth K; Kelly B; Diskin-Holdaway C; Shahin MA; Sethu V; Epps J; Lee CB; Ambikairajah E, 2021, 'AusKidTalk: An Auditory-Visual Corpus of 3- to 12-Year-Old Australian Children's Speech.', in Hermansky H; Cernocký H; Burget L; Lamel L; Scharenborg O; Motlícek P (eds.), Interspeech, ISCA, pp. 3680 - 3684, https://doi.org/10.21437/Interspeech.2021

Conference Papers | 2021

Bose D; Sethu V; Ambikairajah E, 2021, 'Parametric Distributions to Model Numerical Emotion Labels', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 576 - 580, http://dx.doi.org/10.21437/Interspeech.2021-1000

Conference Papers | 2021

Bose D; Sethu V; Ambikairajah E, 2021, 'Parametric Distributions to Model Numerical Emotion Labels.', in Hermansky H; Cernocký H; Burget L; Lamel L; Scharenborg O; Motlícek P (eds.), Interspeech, ISCA, pp. 4498 - 4502, https://doi.org/10.21437/Interspeech.2021

Preprints | 2021

Dang T; Sethu V; Ambikairajah E; Epps J; Li H, 2021, Joint Spatio-Temporal Discretisation of Nonlinear Active Cochlear Models, http://dx.doi.org/10.48550/arxiv.2108.05993

Preprints | 2021

Wu J; Dang T; Sethu V; Ambikairajah E, 2021, A Novel Markovian Framework for Integrating Absolute and Relative Ordinal Emotion Information, http://dx.doi.org/10.48550/arxiv.2108.04605

Conference Papers | 2020

Ambikairajah E; Sethu V, 2020, 'Cochlear Signal Processing: A Platform for Learning the Fundamentals of Digital Signal Processing', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, pp. 9229 - 9233, http://dx.doi.org/10.1109/ICASSP40776.2020.9054297

Conference Papers | 2020

Suthokumar G; Sethu V; Sriskandaraja K; Ambikairajah E, 2020, 'Adversarial Multi-Task Learning for Speaker Normalization in Replay Detection', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, pp. 6609 - 6613, http://dx.doi.org/10.1109/ICASSP40776.2020.9054322

Conference Papers | 2019

Atcheson M; Sethu V; Epps J, 2019, 'Using Gaussian Processes with LSTM Neural Networks to Predict Continuous-Time, Dimensional Emotion in Ambiguous Speech', in 2019 8th International Conference on Affective Computing and Intelligent Interaction Acii 2019, http://dx.doi.org/10.1109/ACII.2019.8925450

Conference Papers | 2019

Bose D; Dang T; Sethu V; Ambikairajah E; Fernando S, 2019, 'A Novel Bag-of-Optimised-Clusters Front-End for Speech based Continuous Emotion Prediction', in 2019 8th International Conference on Affective Computing and Intelligent Interaction Acii 2019, http://dx.doi.org/10.1109/ACII.2019.8925490

Conference Papers | 2019

Ouyang A; Dang T; Sethu V; Ambikairajah E, 2019, 'Speech based emotion prediction: Can a linear model work?', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, ISCA, Graz, Austria, pp. 2813 - 2817, presented at INTERSPEECH 2019, Graz, Austria, 15 September 2019 - 19 September 2019, http://dx.doi.org/10.21437/Interspeech.2019-3149

Preprints | 2019

Sethu V; Provost EM; Epps J; Busso C; Cummins N; Narayanan S, 2019, The Ambiguous World of Emotion Representation, http://dx.doi.org/10.48550/arxiv.1909.00360

Conference Papers | 2019

Suthokumar G; Sriskandaraja K; Sethu V; Wijenayake C; Ambikairajah E, 2019, 'Phoneme Specific Modelling and Scoring Techniques for Anti Spoofing System', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, pp. 6106 - 6110, http://dx.doi.org/10.1109/ICASSP.2019.8682411

Conference Papers | 2019

Wickramasinghe B; Ambikairajah E; Epps J; Sethu V; Li H, 2019, 'Auditory Inspired Spatial Differentiation for Replay Spoofing Attack Detection', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, pp. 6011 - 6015, http://dx.doi.org/10.1109/ICASSP.2019.8683693

Conference Papers | 2018

Atcheson M; Sethu V; Epps J, 2018, 'Demonstrating and modelling systematic time-varying annotator disagreement in continuous emotion annotation', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 3668 - 3672, http://dx.doi.org/10.21437/Interspeech.2018-1933

Conference Papers | 2018

Dang T; Sethu V; Ambikairajah E, 2018, 'Dynamic Multi-Rater Gaussian Mixture Regression Incorporating Temporal Dependencies of Emotion Uncertainty Using Kalman Filters', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, pp. 4929 - 4933, http://dx.doi.org/10.1109/ICASSP.2018.8461321

Conference Papers | 2018

Fernando S; Irtza S; Sethu V; Ambikairajah E, 2018, 'Advances in Feature Extraction and Modelling for Short Duration Language Identification', in 2018 IEEE 9th International Conference on Information and Automation for Sustainability Iciafs 2018, http://dx.doi.org/10.1109/ICIAFS.2018.8913386

Conference Papers | 2018

Fernando S; Sethu V; Ambikairajah E; Li H, 2018, 'Second Order Factorized Model Adaptation for Short Duration Language Identification', in 2018 Asia Pacific Signal and Information Processing Association Annual Summit and Conference Apsipa ASC 2018 Proceedings, pp. 1440 - 1447, http://dx.doi.org/10.23919/APSIPA.2018.8659586

Conference Papers | 2018

Fernando S; Sethu V; Ambikairajah E, 2018, 'Factorized Hidden Variability Learning for Adaptation of Short Duration Language Identification Models', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, pp. 5204 - 5208, http://dx.doi.org/10.1109/ICASSP.2018.8462094

Conference Papers | 2018

Fernando S; Sethu V; Ambikairajah E, 2018, 'Sub-band envelope features using frequency domain linear prediction for short duration language identification', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 1818 - 1822, http://dx.doi.org/10.21437/Interspeech.2018-1805

Conference Papers | 2018

Gamage KW; Dang T; Sethu V; Epps J; Ambikairajah E, 2018, 'Speech-based Continuous Emotion Prediction by Learning Perception Responses related to Salient Events: A Study based on Vocal Affect Bursts and Cross-Cultural Affect in AVEC 2018', in Avec 2018 Proceedings of the 2018 Audio Visual Emotion Challenge and Workshop Co Located with mm 2018, pp. 47 - 55, http://dx.doi.org/10.1145/3266302.3266314

Conference Papers | 2018

Irtza S; Sethu V; Ambikairajah E; Li H, 2018, 'End-to-End Hierarchical Language Identification System', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, pp. 5199 - 5203, http://dx.doi.org/10.1109/ICASSP.2018.8461419

Conference Papers | 2018

Ma J; Sethu V; Ambikairajah E; Lee KA, 2018, 'Speaker-Phonetic Vector Estimation for Short Duration Speaker Verification', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, pp. 5264 - 5268, http://dx.doi.org/10.1109/ICASSP.2018.8461978

Conference Papers | 2018

Sriskandaraja K; Sethu V; Ambikairajah E, 2018, 'Deep Siamese architecture based replay detection for secure voice biometric', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 671 - 675, http://dx.doi.org/10.21437/Interspeech.2018-1819

Conference Papers | 2018

Suthokumar G; Sethu V; Wijenayake C; Ambikairajah E, 2018, 'Modulation dynamic features for the detection of replay attacks', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 691 - 695, http://dx.doi.org/10.21437/Interspeech.2018-1846

Conference Papers | 2018

Suthokumar G; Sriskandaraja K; Sethu V; Wijenayake C; Ambikairajah E; Li H, 2018, 'Use of Claimed Speaker Models for Replay Detection', in 2018 Asia Pacific Signal and Information Processing Association Annual Summit and Conference Apsipa ASC 2018 Proceedings, pp. 1038 - 1046, http://dx.doi.org/10.23919/APSIPA.2018.8659510

Conference Papers | 2018

Suthokumar G; Sriskandaraja K; Sethu V; Wijenayake C; Ambikairajah E, 2018, 'An Investigation about the Scalability of the Spoofing Detection System', in 2018 IEEE 9th International Conference on Information and Automation for Sustainability Iciafs 2018, http://dx.doi.org/10.1109/ICIAFS.2018.8913369

Conference Papers | 2017

Atcheson M; Sethu V; Epps J, 2017, 'Gaussian Process Regression for Continuous Emotion Recognition with Global Temporal Invariance.', in Lawrence N; Reid M (ed.), AffComp@IJCAI, PMLR, pp. 34 - 44, http://proceedings.mlr.press/v66/

Conference Papers | 2017

Cetin E; Abewardana Wijenayake C; Sethu V; Ambikairajah E, 2017, 'A Flipped Mode Approach to Teaching an Electronic System Design Course', in PROCEEDINGS OF 2017 IEEE 6TH INTERNATIONAL CONFERENCE ON TEACHING, ASSESSMENT, AND LEARNING FOR ENGINEERING (TALE), IEEE, Hong Kong, pp. 223 - 228, presented at IEEE International Conference on Teaching, Assessment, and Learning for Engineering, Hong Kong, 12 December 2017 - 14 December 2017, http://dx.doi.org/10.1109/TALE.2017.8252337

Conference Papers | 2017

Dang T; Atcheson M; Stasak B; Hayat M; Goecke R; Huang Z; Le P; Epps J; Jayawardena S; Sethu V, 2017, 'Investigating word affect features and fusion of probabilistic predictions incorporating uncertainty in AVEC 2017', in Ringeval F; Schuller BW; Valstar MF; Gratch J; Cowie R; Pantic M (eds.), AVEC 2017 - Proceedings of the 7th Annual Workshop on Audio/Visual Emotion Challenge, co-located with MM 2017, Association for Computing Machinery (ACM), Mountain View, California, USA, pp. 27 - 35, presented at 7th Annual Workshop on Audio/Visual Emotion Challenge, Mountain View, California, USA, 23 October 2017 - 23 October 2017, http://dx.doi.org/10.1145/3133944.3133952

Conference Papers | 2017

Dang T; Sethu V; Epps J; Ambikairajah E, 2017, 'An investigation of emotion prediction uncertainty using Gaussian Mixture Regression', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Stockholm, Sweden, pp. 1248 - 1252, presented at INTERSPEECH 2017, Stockholm, Sweden, 20 August 2017 - 24 August 2017, http://dx.doi.org/10.21437/Interspeech.2017-512

Conference Papers | 2017

Fernando S; Sethu V; Ambikairajah E; Epps J, 2017, 'Bidirectional modelling for short duration language identification', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Stockholm, Sweden, pp. 2809 - 2813, presented at INTERSPEECH 2017, Stockholm, Sweden, 20 August 2017 - 24 August 2017, http://dx.doi.org/10.21437/Interspeech.2017-286

Conference Papers | 2017

Gamage KW; Sethu V; Ambikairajah E, 2017, 'Modeling variable length phoneme sequences - A step towards linguistic information for speech emotion recognition in wider world', in 2017 7th International Conference on Affective Computing and Intelligent Interaction Acii 2017, pp. 518 - 523, http://dx.doi.org/10.1109/ACII.2017.8273648

Conference Papers | 2017

Gamage KW; Sethu V; Ambikairajah E, 2017, 'Salience based lexical features for emotion recognition', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, pp. 5830 - 5834, http://dx.doi.org/10.1109/ICASSP.2017.7953274

Conference Papers | 2017

Irtza S; Sethu V; Ambikairajah E; Li H, 2017, 'Investigating scalability in hierarchical language identification system', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Stockholm, Sweden, pp. 2581 - 2585, presented at INTERSPEECH 2017, Stockholm, Sweden, 20 August 2017 - 24 August 2017, http://dx.doi.org/10.21437/Interspeech.2017-596

Conference Papers | 2017

Lee KA; Hautamäki V; Kinnunen T; Larcher A; Zhang C; Nautsch A; Stafylakis T; Liu G; Rouvier M; Rao W; Alegre F; Ma J; Mak MW; Sarkar AK; Delgado H; Saeidi R; Aronowitz H; Sizov A; Sun H; Nguyen TH; Wang G; Ma B; Vestman V; Sahidullah M; Halonen M; Kanervisto A; Le Lan G; Bahmaninezhad F; Isadskiy S; Rathgeb C; Busch C; Tzimiropoulos G; Qian Q; Wang Z; Zhao Q; Wang T; Li H; Xue J; Zhu S; Jin R; Zhao T; Bousquet PM; Ajili M; Kheder WB; Matrouf D; Lim ZH; Xu C; Xu H; Xiao X; Chng ES; Fauve B; Sriskandaraja K; Sethu V; Lin WW; Thomsen DAL; Tan ZH; Todisco M; Evans N; Li H; Hansen JHL; Bonastre JF; Ambikairajah E, 2017, 'The I4U mega fusion and collaboration for NIST speaker recognition evaluation 2016', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Stockholm, Sweden, pp. 1328 - 1332, presented at INTERSPEECH 2017, Stockholm, Sweden, 20 August 2017 - 24 August 2017, http://dx.doi.org/10.21437/Interspeech.2017-203

Conference Papers | 2017

Ma J; Sethu V; Ambikairajah E; Lee KA, 2017, 'Incorporating local acoustic variability information into short duration speaker verification', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Stockholm, Sweden, pp. 1502 - 1506, presented at Interspeech 2017, Stockholm, Sweden, 20 August 2017 - 24 August 2017, http://dx.doi.org/10.21437/Interspeech.2017-266

Conference Papers | 2017

Sriskandaraja K; Suthokumar G; Sethu V; Ambikairajah E, 2017, 'Investigating the use of scattering coefficients for replay attack detection', in Proceedings 9th Asia Pacific Signal and Information Processing Association Annual Summit and Conference Apsipa ASC 2017, pp. 1195 - 1198, http://dx.doi.org/10.1109/APSIPA.2017.8282211

Conference Papers | 2017

Suthokumar G; Sriskandaraja K; Sethu V; Wijenayake C; Ambikairajah E, 2017, 'Independent modelling of high and low energy speech frames for spoofing detection', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Stockholm, Sweden, pp. 2606 - 2610, presented at INTERSPEECH 2017, Stockholm, Sweden, 20 August 2017 - 24 August 2017, http://dx.doi.org/10.21437/Interspeech.2017-836

Conference Papers | 2016

Dang T; Sethu V; Ambikairajah E, 2016, 'Factor analysis based speaker normalisation for continuous emotion prediction', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 913 - 917, http://dx.doi.org/10.21437/interspeech.2016-880

Conference Papers | 2016

Fernando S; Sethu V; Ambikairajah E, 2016, 'A feature normalisation technique for PLLR based language identification systems', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, San Francisco, CA, USA, pp. 2925 - 2929, presented at Interspeech 2016, San Francisco, CA, USA, 08 September 2016 - 12 September 2016, http://dx.doi.org/10.21437/Interspeech.2016-560

Conference Papers | 2016

Huang Z; Stasak B; Dang T; Gamage KW; Le P; Sethu V; Epps J, 2016, 'Staircase regression in OA RVM, data selection and gender dependency in AVEC 2016', in AVEC 2016 - Proceedings of the 6th International Workshop on Audio/Visual Emotion Challenge, co-located with ACM Multimedia 2016, ASSOC COMPUTING MACHINERY, Amsterdam, NETHERLANDS, pp. 19 - 26, presented at 6th International Workshop on Audio-Visual Emotion Recognition Challenge - Depression, Mood, and Emotion (AVEC), Amsterdam, NETHERLANDS, 16 October 2016 - 16 October 2016, http://dx.doi.org/10.1145/2988257.2988265

Conference Papers | 2016

Irtza S; Sethu V; Bavattichalil H; Ambikairajah E; Li H, 2016, 'A hierarchical framework for language identification', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Shanghai, China, pp. 5820 - 5824, presented at 2016 IEEE International Conference on, Shanghai, China, 20 March 2016 - 25 March 2016, http://dx.doi.org/10.1109/ICASSP.2016.7472793

Conference Papers | 2016

Irtza S; Sethu V; Fernando S; Ambikairajah E; Li H, 2016, 'Out of set language modelling in Hierarchical language identification', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 3270 - 3274, http://dx.doi.org/10.21437/interspeech.2016-558

Conference Papers | 2016

Ma J; Irtza S; Sriskandaraja K; Sethu V; Ambikairajah E, 2016, 'Parallel speaker and content modelling for text-dependent speaker verification', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, San Francisco, USA, pp. 435 - 439, presented at Interspeech 2016, San Francisco, USA, 08 September 2016 - 12 September 2016, http://dx.doi.org/10.21437/Interspeech.2016-825

Conference Papers | 2016

Ma J; Sethu V; Ambikairajah E; Lee KA, 2016, 'Twin model G-PLDA for duration mismatch compensation in text-independent speaker verification', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, San Francisco, USA, pp. 1853 - 1857, presented at Interspeech 2016, San Francisco, USA, 08 September 2016 - 12 September 2016, http://dx.doi.org/10.21437/Interspeech.2016-683

Conference Papers | 2016

Sethu V; Fernando S; Ambikairajah E, 2016, 'Eigenfeatures: An alternative to Shifted Delta Coefficients for Language Identification', in SST2016, ASSTA, Parramatta, Australia, pp. 253 - 256, presented at 16th Speech Science and Technology Conference (SST2016), Parramatta, Australia, 06 December 2016 - 09 December 2017, https://www.researchgate.net/publication/311615271_Eigenfeatures_An_alternative_to_Shifted_Delta_Coefficients_for_Language_Identification

Conference Papers | 2016

Sriskandaraja K; Sethu V; Le PN; Ambikairajah E, 2016, 'Investigation of sub-band discriminative information between spoofed and genuine speech', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, San Francisco, USA, pp. 1710 - 1714, presented at Interspeech 2016, San Francisco, USA, 08 September 2016 - 12 September 2016, http://dx.doi.org/10.21437/Interspeech.2016-844

Conference Papers | 2015

Cummins N; Epps J; Sethu V; Krajewski J, 2015, 'Weighted pairwise Gaussian likelihood regression for depression score prediction', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, pp. 4779 - 4783, http://dx.doi.org/10.1109/ICASSP.2015.7178878

Conference Papers | 2015

Cummins N; Sethu V; Epps J; Krajewski J, 2015, 'Relevance Vector Machine for Depression Prediction', in Annual Conference of the International Speech Communication Association (Interspeech), Dresden, Germany, presented at Annual Conference of the International Speech Communication Association (Interspeech), Dresden, Germany, 06 September 2015 - 10 September 2015, http://www.isca-speech.org/archive/interspeech_2015/i15_0110.html

Conference Papers | 2015

Epps J; Sethu V; Eaton R; Ambikairajah E, 2015, 'High Definition Multi-View Video Guidance for Self-Directed Learning and More Effective Engineering Laboratories', Geelong,Australia, presented at Australasian Association for Engineering Education, Geelong,Australia, 06 December 2015 - 09 December 2015, https://aaee2015conference.sched.org/event/5aaZ/4b-high-definition-multi-view-video-guidance-for-self-directed-learning-and-more-effective-engineering-laboratories

Conference Papers | 2015

Gamage KW; Sethu V; Le P; Ambikairajah E, 2015, 'An i-vector GPLDA System for Speech based Emotion Recognition', in 2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA), Hong Kong, presented at The proceedings of 7th Asia-Pacific Signal and Information Processing Association Conference (APSIPA), Hong Kong, 16 December 2015 - 19 December 2015, http://dx.doi.org/10.1109/APSIPA.2015.7415522

Conference Papers | 2015

Hines C; Sethu V; Epps J, 2015, 'Twitter: A new online source of automatically tagged data for conversational speech emotion recognition', in ASM 2015 Proceedings of the 1st International Workshop on Affect and Sentiment in Multimedia Co Located with ACM mm 2015, pp. 9 - 14, http://dx.doi.org/10.1145/2813524.2813529

Conference Papers | 2015

Huang Z; Dang T; Cummins N; Stasak B; Le P; Sethu V; Epps J, 2015, 'An investigation of annotation delay compensation and output-associative fusion for multimodal continuous emotion prediction', in Avec 2015 Proceedings of the 5th International Workshop on Audio Visual Emotion Challenge Co Located with mm 2015, pp. 41 - 48, http://dx.doi.org/10.1145/2808196.2811640

Conference Papers | 2015

Irtza S; Bavattichalil H; Sethu V; Ambikairajah E, 2015, 'Scalable I-vector Concatenation for PLDA based Language Identification System', in The proceedings of 7th Asia-Pacific Signal and Information Processing Association Conference (APSIPA), Hong Kong, presented at The proceedings of 7th Asia-Pacific Signal and Information Processing Association Conference (APSIPA), Hong Kong, 16 December 2015 - 19 December 2015, http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=7415458

Conference Papers | 2015

Irtza S; Sethu V; Le P; Ambikairajah E; Li H, 2015, 'Phonemes Frequency Based PLLR Dimensionality Reduction for Language Recognition', Dresden, Germany, presented at In Sixteenth Annual Conference of the International Speech Communication Association (Interspeech), Dresden, Germany, 06 September 2015 - 10 September 2015

Conference Papers | 2015

Khlif A; Sethu V, 2015, 'An iterative multi range non-negative matrix factorization algorithm for polyphonic music transcription', in Proceedings of the 16th International Society for Music Information Retrieval Conference Ismir 2015, pp. 330 - 335

Conference Papers | 2015

Sriskandaraja K; Sethu V; Le P; Ambikairajah E, 2015, 'A Model Based Voice Activity Detector for Noisy Environments', Dresden, Germany, presented at Sixteenth Annual Conference of the International Speech Communication Association (Interspeech), Dresden, Germany, 06 September 2015 - 10 September 2015, http://www.isca-speech.org/archive/interspeech_2015/i15_2297.html

Conference Papers | 2014

Cummins N; Epps J; Sethu V; Krajewski J, 2014, 'Variability compensation in small data: Oversampled extraction of i-vectors for the classification of depressed speech', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, pp. 970 - 974, http://dx.doi.org/10.1109/ICASSP.2014.6853741

Conference Papers | 2014

Cummins N; Sethu V; Epps J; Krajewski J, 2014, 'Probabilistic acoustic volume analysis for speech affected by depression', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 1238 - 1242

Conference Papers | 2014

Kua JMK; Sethu V; Le P; Ambikairajah E, 2014, 'The UNSW submission to INTERSPEECH 2014 ComParE cognitive load challenge', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 746 - 750

Conference Papers | 2013

Cummins N; Epps J; Sethu V; Breakspear M; Goecke R, 2013, 'Modeling Spectral Variability for the Classification of Depressed Speech', in INTERSPEECH 2013, 14thAnnual Conference of the International Speech Communication Association, Lyon, France, presented at 14th Annual Conference of the International Speech Communication Association Interspeech2013, Lyon, France, 25 August 2013 - 29 August 2013

Conference Papers | 2013

Cummins N; Joshi J; Dhall A; Sethu V; Goecke R; Epps J, 2013, 'Diagnosis of depression by behavioural signals: A multimodal approach', in Avec 2013 Proceedings of the 3rd ACM International Workshop on Audio Visual Emotion Challenge, pp. 11 - 20, http://dx.doi.org/10.1145/2512530.2512535

Conference Papers | 2013

Sethu V; Epps J; Ambikairajah E, 2013, 'GMM Based Speaker Variability Compensated System for Interspeech 2013 ComParE Emotion Challenge', in CERISARA C (ed.), INTERSPEECH 2013, 14thAnnual Conference of the International Speech Communication Association, Lyon, France, presented at INTERSPEECH 2013 14thAnnual Conference of the International Speech Communication Association, Lyon, France, 25 August 2013 - 29 August 2013

Conference Papers | 2013

Sethu V; Epps J; Ambikairajah E, 2013, 'Speaker variability in speech based emotion models - Analysis and normalisation', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, pp. 7522 - 7525, http://dx.doi.org/10.1109/ICASSP.2013.6639125

Conference Papers | 2012

Ambikairajah E; Kua JM; Sethu V; Li H, 2012, 'PNCC-ivector-SRC based Speaker Verification', in 2012 Conference Handbook - Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2012, APSIPA, Hollywood, California, USA, presented at Asia Pacific Signal and Information Processing Association, Hollywood, California, USA, 03 December 2012 - 06 December 2012

Conference Papers | 2012

Ding N; Sethu V; Epps JR; Ambikairajah E, 2012, 'Speaker variability in emotion recognition - An adaptation based approach', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Institute of Electrical and Electronics Engineers Inc., Piscataway, NJ, pp. 5101 - 5104, presented at 2012 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2012, Kyoto, Japan, 25 March 2012 - 30 March 2012, http://dx.doi.org/10.1109/ICASSP.2012.6289068

Conference Papers | 2011

Le PN; Sethu V; Ambikairajah E; Kua JMK, 2011, 'Investigation of the robustness of a non-uniform filterbank for cognitive load classification', in Icics 2011 8th International Conference on Information Communications and Signal Processing, http://dx.doi.org/10.1109/ICICS.2011.6174268

Conference Papers | 2010

Ambikairajah E; Ibrahim RK; Sethu V, 2010, 'Novel delta zero crossing regression features for gait pattern classification', IEEE, Beunos Aires, presented at Proceedings of the 32nd Annual International Conference of the IEEE EMBS, Beunos Aires, 31 August 2010 - 04 September 2010

Conference Papers | 2010

Le NP; Epps JR; Ambikairajah E; Sethu V, 2010, 'Robust Speech-Based Cognitive Load Classification Using a Multi-band Approach', in The Proceedings of APSIPA ASC 2010, Asia-Pacific Signal Processing Association, Hong Kong, presented at Asia-Pacific Signal Processing Association Conf., Singapore, 14 December 2010 - 17 December 2010

Conference Papers | 2009

Sethu V; Ambikairajah E; Epps JR, 2009, 'Pitch Contour Prameterisation based on Linear Stylisation for Emotion Recognition', in Interspeech 2012, Curran Associates, Inc, Brighton, UK, presented at Interspeech 2009 Speech and Intelligence, Brighton, UK, 06 September 2009 - 10 September 2009

Conference Papers | 2009

Sethu V; Ambikairajah E; Epps JR, 2009, 'SPEAKER DEPENDENCY OF SPECTRAL FEATURES AND SPEECH PRODUCTION CUES FOR AUTOMATIC EMOTION CLASSIFICATION', in IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, IEEE, USA, presented at ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing, Taipei, Taiwan, 19 April 2009 - 24 April 2009

Conference Papers | 2009

Sethu V; Ambikairajah E; Epps J, 2009, 'Pitch contour parameterisation based on linear stylisation for emotion recognition', in Interspeech 2009, ISCA, presented at Interspeech 2009, http://dx.doi.org/10.21437/interspeech.2009-579

Conference Papers | 2008

Le NP; Ambikairajah E; Sethu V, 2008, 'Speech enhancement based on empirical mode decomposition', in Modelling, Identification and Control 2008, Innsbruck, Austria, presented at 5th IASTED International Conference on Signal Processing, Pattern Recognition and Applications 2008, Innsbruck, Austria, 13 February 2008 - 15 February 2008

Conference Papers | 2008

Sethu V; Ambikairajah E; Epps JR, 2008, 'Empirical mode decomposition based weighted frequency feature for speech-based emotion classification', in IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, IEEE, USA, presented at IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2008), 31 March 2008 - 04 April 2008

Conference Papers | 2008

Sethu V; Ambikairajah E; Epps JR, 2008, 'Phonetic and speaker variations in automatic emotion classification', in Interspeech 2012, Curran Associates, Inc, Brisbane Australia, presented at Interspeech 2008, Brisbane Australia, 22 September 2008 - 26 September 2008

Conference Papers | 2007

Sethu V; Ambikairajah E; Epps JR, 2007, 'Group Delay Features for Emotion Detection', in INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECHCOMMUNICATION ASSOCIATION, VOLS 1-4, Isca-Inst Speech Communication Assoc, Baixas

Conference Papers | 2007

Sethu V; Ambikairajah E; Epps JR, 2007, 'Speaker normalisation for speech-based emotion detection', in 2007 15th International Conference on Digital Signal Processing, Wales, UK, presented at 15th International Conference on Digital Signal Processing 2007, Wales, UK, 01 July 2007 - 04 July 2007

Conference Papers | 2007

Wang Y; An J; Sethu V; Ambikairajah E, 2007, 'Perceptually motivated pre-filter for speech enhancement using Kalman filtering', in 2007 6th International Conference on Information Communications and Signal Processing Icics, http://dx.doi.org/10.1109/ICICS.2007.4449758

Conference Papers | 2007

Sethu V; Ambikairajah E; Epps J, 2007, 'Group delay features for emotion detection', in Interspeech 2007, ISCA, presented at Interspeech 2007, http://dx.doi.org/10.21437/interspeech.2007-617

Conference Papers | 2006

Ambikairajah E; Sethu V; Ge L, 2006, 'Noise reduction in SAR interferograms using undecimated wavelet transform', in 2nd international symposium on Geo-information for Disaster Management, Goa, India, presented at 2nd international symposium on Geo-information for Disaster Management, Goa, India, 25 September 2006 - 26 September 2006

ARC Discovery Project (2020)
ARC Discovery Project (2019)
ARC LIEF Grant (2019)
UNSW Research Infrastructure (2019)
UNSW Faculty of Engineering Research Infrastructure (2018)
Huawei Innovation Research Program (2018)
UNSW SEIF Grant (2018)
ARC Linkage (2017)
UNSW Faculty of Engineering Silverstar (2016)
UNSW Strategic Educational Development Grant (2014)
NICTA International Postgraduate Award (2006-2009)

Research Interests include:

Artificial Emotional Intelligence and Speech based Emotion Recognition
Computational models of cochlear signal processing
Speaker recognition/Voice biometrics
Application of machine learning to signal processing tasks

My Teaching

I currently teach or have previously taught the following courses at UNSW:

Data Science for Electrical Engineers (ELEC9741)
Speech Processing (ELEC9723)
Digital Signal Processing (ELEC3104)
Electrical Systems Design (ELEC2117)
Design Proficiency (ELEC/TELE/PHTN4123)

Follow

Associate Professor Vidhyasaharan Sethu