Associate Professor Vidhyasaharan Sethu

Associate Professor Vidhyasaharan Sethu

Associate Professor

PhD (UNSW), MEngSc in Signal Processing (UNSW), BE in Electronics and Communication Engineering (Anna University)

Engineering
Electrical Engineering and Telecommunications

Vidhyasaharan Sethu is an Associate Professor with the School of Electrical Engineering and Telecommunications. His primary research interests are in the field of speech signal processing. Particularly in the application of machine learning techniques for addressing speech processing tasks. His research interests include speech based emotion and mental state recognition systems, affective computing, voice biometrics and more broadly the overlap between machine learning and signal processing.

    Phone
    +61 2 9385 7737
    Location
    Room 442, EE&T Building (G17), UNSW Sydney
    • Book Chapters | 2015
      Sethu V; Epps J; Ambikairajah E, 2015, 'Speech based emotion recognition', in Speech and Audio Processing for Coding, Enhancement and Recognition, Springer Link, pp. 197 - 228, http://dx.doi.org/10.1007/978-1-4939-1456-2_7
      Book Chapters | 2014
      Ambikairajah E; Sethu V; Eaton R; Sheng M, 2014, 'Evolving use of educational technologies: Enhancing lectures', in Using Technology Tools to Innovate Assessment Reporting and Teaching Practices in Engineering Education, pp. 241 - 258, http://dx.doi.org/10.4018/978-1-4666-5011-4.ch018
    • Journal articles | 2025
      Charls D; Sethu V; Ahmed B, 2025, 'Uncertainty-Aware Domain Adaptation for ECG Classification', Annual International Conference of the IEEE Engineering in Medicine and Biology Society IEEE Engineering in Medicine and Biology Society Annual International Conference, 2025, pp. 1 - 6, http://dx.doi.org/10.1109/EMBC58623.2025.11254147
      Journal articles | 2025
      Jing M; Sethu V; Ahmed B; Lee KA, 2025, 'Quantifying prediction uncertainties in automatic speaker verification systems', Computer Speech and Language, 94, http://dx.doi.org/10.1016/j.csl.2025.101806
      Journal articles | 2025
      Wu J; Dang T; Sethu V; Ambikairajah E, 2025, 'How many raters do we need? Analyses of uncertainty in estimating ambiguity-aware emotion labels', IEEE Transactions on Affective Computing, http://dx.doi.org/10.1109/TAFFC.2025.3616071
      Journal articles | 2025
      Zhang Q; Wickramasinghe B; Ambikairajah E; Sethu V; Li H, 2025, 'Should Audio Front-Ends be Adaptive? Comparing Learnable and Adaptive Front-Ends', IEEE TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 33, pp. 998 - 1010, http://dx.doi.org/10.1109/TASLPRO.2025.3542281
      Journal articles | 2024
      Bose D; Sethu V; Ambikairajah E, 2024, 'Continuous Emotion Ambiguity Prediction: Modeling with Beta Distributions', IEEE Transactions on Affective Computing, 15, pp. 1684 - 1695, http://dx.doi.org/10.1109/TAFFC.2024.3367371
      Journal articles | 2024
      Haghshenas Y; Wong WP; Gunawan D; Khataee A; Keyikoğlu R; Razmjou A; Kumar PV; Toe CY; Masood H; Amal R; Sethu V; Teoh WY, 2024, 'Predicting the rates of photocatalytic hydrogen evolution over cocatalyst-deposited TiO2 using machine learning with active photon flux as a unifying feature', Ees Catalysis, 2, pp. 612 - 623, http://dx.doi.org/10.1039/d3ey00246b
      Journal articles | 2024
      Haghshenas Y; Wong WP; Sethu V; Amal R; Kumar PV; Teoh WY, 2024, 'Full prediction of band potentials in semiconductor materials', Materials Today Physics, 46, pp. 101519, http://dx.doi.org/10.1016/j.mtphys.2024.101519
      Journal articles | 2024
      Hong X; Gong Y; Sethu V; Dang T, 2024, 'AER-LLM: Ambiguity-aware Emotion Recognition Leveraging Large Language Models.', CoRR, abs/2409.18339
      Journal articles | 2024
      Meng H; Zhang Q; Zhang X; Sethu V; Ambikairajah E, 2024, 'Binaural Selective Attention Model for Target Speaker Extraction', Interspeech 2024, pp. 4323 - 4327, http://dx.doi.org/10.21437/interspeech.2024-683
      Journal articles | 2024
      Nan Z; Dang T; Sethu V; Ahmed B, 2024, 'A Joint Spectro-Temporal Relational Thinking Based Acoustic Modeling Framework.', CoRR, abs/2409.15357
      Journal articles | 2024
      Nan Z; Dang T; Sethu V; Ahmed B, 2024, 'Variational Connectionist Temporal Classification for Order-Preserving Sequence Modeling', ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pp. 6495 - 6499, http://dx.doi.org/10.1109/icassp48485.2024.10447530
      Journal articles | 2024
      Wu J; Dang T; Sethu V; Ambikairajah E, 2024, 'Dual-Constrained Dynamical Neural ODEs for Ambiguity-aware Continuous Emotion Prediction', Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 3185 - 3189, http://dx.doi.org/10.21437/Interspeech.2024-119
      Journal articles | 2023
      Dimitriadis A; Pan S; Sethu V; Ahmed B, 2023, 'Spatial HuBERT: Self-supervised Spatial Speech Representation Learning for a Single Talker from Multi-channel Audio.', CoRR, abs/2310.10922
      Journal articles | 2023
      Masood H; Sirojan T; Toe CY; Kumar PV; Haghshenas Y; Sit PHL; Amal R; Sethu V; Teoh WY, 2023, 'Enhancing prediction accuracy of physical band gaps in semiconductor materials', Cell Reports Physical Science, 4, http://dx.doi.org/10.1016/j.xcrp.2023.101555
      Journal articles | 2023
      Wickramasinghe B; Ambikairajah E; Sethu V; Epps J; Li H; Dang T, 2023, 'DNN controlled adaptive front-end for replay attack detection systems', Speech Communication, 154, http://dx.doi.org/10.1016/j.specom.2023.102973
      Journal articles | 2023
      Wu J; Dang T; Sethu V; Ambikairajah E, 2023, 'A Novel Markovian Framework for Integrating Absolute and Relative Ordinal Emotion Information', IEEE Transactions on Affective Computing, 14, pp. 2089 - 2101, http://dx.doi.org/10.1109/TAFFC.2022.3159782
      Journal articles | 2022
      Ramandi HL; Irtza S; Sirojan T; Naman A; Mathew R; Sethu V; Roshan H; Lamei Ramandi H, 2022, 'FracDetect: A novel algorithm for 3D fracture detection in digital fractured rocks', Journal of Hydrology, 607, pp. 127482, http://dx.doi.org/10.1016/j.jhydrol.2022.127482
      Journal articles | 2021
      Aboutanios E; Sethu V; Ambikairajah E; Taubman DS; Epps J, 2021, 'Teaching Signal Processing through Frequent and Diverse Design: A Pedagogical Approach', IEEE Signal Processing Magazine, 38, pp. 133 - 143, http://dx.doi.org/10.1109/MSP.2021.3057855
      Journal articles | 2021
      Gunendradasan T; Ambikairajah E; Epps J; Sethu V; Li H, 2021, 'An adaptive transmission line cochlear model based front-end for replay attack detection', Speech Communication, 132, pp. 114 - 122, http://dx.doi.org/10.1016/j.specom.2021.06.004
      Journal articles | 2021
      Wu J; Dang T; Sethu V; Ambikairajah E, 2021, 'A Novel Markovian Framework for Integrating Absolute and Relative Ordinal Emotion Information.', CoRR, abs/2108.04605
      Journal articles | 2021
      Wu J; Dang T; Sethu V; Ambikairajah E, 2021, 'Multimodal Affect Models: An Investigation of Relative Salience of Audio and Visual Cues for Emotion Prediction', Frontiers in Computer Science, 3, http://dx.doi.org/10.3389/fcomp.2021.767767
      Journal articles | 2020
      Cummins N; Sethu V; Epps J; Williamson JR; Quatieri TF; Krajewski J, 2020, 'Generalized two-stage rank regression framework for depression score prediction from speech', IEEE Transactions on Affective Computing, 11, pp. 272 - 283, http://dx.doi.org/10.1109/TAFFC.2017.2766145
      Journal articles | 2020
      Huang Z; Epps J; Joachim D; Sethu V, 2020, 'Natural Language Processing Methods for Acoustic and Landmark Event-Based Features in Speech-Based Depression Detection', IEEE Journal on Selected Topics in Signal Processing, 14, pp. 435 - 448, http://dx.doi.org/10.1109/JSTSP.2019.2949419
      Journal articles | 2020
      Suthokumar G; Sriskandaraja K; Sethu V; Ambikairajah E; Li H, 2020, 'An analysis of speaker dependent models in replay detection', Apsipa Transactions on Signal and Information Processing, 9, http://dx.doi.org/10.1017/ATSIP.2020.9
      Journal articles | 2019
      Brown S; Sethu V; Taubman D, 2019, 'Spatial Wiener filter to reduce spatial aliasing with spherical microphone arrays', Journal of the Acoustical Society of America, 145, pp. 2254 - 2264, http://dx.doi.org/10.1121/1.5096184
      Journal articles | 2019
      Masood H; Toe CY; Teoh WY; Sethu V; Amal R, 2019, 'Machine Learning for Accelerated Discovery of Solar Photocatalysts', ACS Catalysis, 9, pp. 11774 - 11787, http://dx.doi.org/10.1021/acscatal.9b02531
      Journal articles | 2019
      Sethu V; Provost EM; Epps J; Busso C; Cummins N; Narayanan SS, 2019, 'The Ambiguous World of Emotion Representation.', CoRR, abs/1909.00360
      Journal articles | 2019
      Vukovic M; Sethu V; Parker J; Cavedon L; Lech M; Thangarajah J, 2019, 'Estimating cognitive load from speech gathered in a complex real-life training exercise', International Journal of Human Computer Studies, 124, pp. 116 - 133, http://dx.doi.org/10.1016/j.ijhcs.2018.12.003
      Journal articles | 2018
      Dang T; Sethu V; Ambikairajah E, 2018, 'Compensation Techniques for Speaker Variability in Continuous Emotion Prediction', IEEE Transactions on Affective Computing, pp. 1 - 15, http://dx.doi.org/10.1109/TAFFC.2018.2883044
      Journal articles | 2018
      Fernando S; Sethu V; Ambikairajah E, 2018, 'Hidden variability subspace learning for adaptation of deep neural networks', Electronics Letters, 54, pp. 173 - 175, http://dx.doi.org/10.1049/el.2017.4027
      Journal articles | 2018
      Irtza S; Sethu V; Ambikairajah E; Li H, 2018, 'Using language cluster models in hierarchical language identification', Speech Communication, 100, pp. 30 - 40, http://dx.doi.org/10.1016/j.specom.2018.04.004
      Journal articles | 2018
      Ma J; Sethu V; Ambikairajah E; Lee KA, 2018, 'Generalized variability model for speaker verification', IEEE Signal Processing Letters, 25, pp. 1775 - 1779, http://dx.doi.org/10.1109/LSP.2018.2874814
      Journal articles | 2017
      Ma J; Sethu V; Ambikairajah E; Lee KA, 2017, 'Duration compensation of i-vectors for short duration speaker verification', Electronics Letters, 53, pp. 405 - 407, http://dx.doi.org/10.1049/el.2016.4629
      Journal articles | 2017
      Sriskandaraja K; Sethu V; Ambikairajah E; Li H, 2017, 'Front-end for antispoofing countermeasures in speaker verification: Scattering spectral decomposition', IEEE Journal on Selected Topics in Signal Processing, 11, pp. 632 - 643, http://dx.doi.org/10.1109/JSTSP.2016.2647202
      Journal articles | 2015
      Cummins N; Sethu V; Epps J; Schnieder S; Krajewski J, 2015, 'Analysis of acoustic space variability in speech affected by depression', Speech Communication, 75, pp. 27 - 49, http://dx.doi.org/10.1016/j.specom.2015.09.003
      Journal articles | 2015
      Thiruvaran T; Sethu V; Ambikairajah E; Li H, 2015, 'Spectral shifting of speaker-specific information for narrow band telephonic speaker recognition', Electronics Letters, http://dx.doi.org/10.1049/el.2015.3117
      Journal articles | 2013
      Sethu V; Ambikairajah E; Epps J, 2013, 'On the use of speech parameter contours for emotion recognition', Eurasip Journal on Audio Speech and Music Processing, 2013, http://dx.doi.org/10.1186/1687-4722-2013-19
      Journal articles | 2011
      Ambikairajah E; Li H; Wang L; Yin B; Sethu V, 2011, 'Language Identification: A Tutorial', Circuits and Systems Magazine, IEEE, 11, pp. 82 - 108, http://dx.doi.org/10.1109/MCAS.2011.941081
      Journal articles | 2011
      Le NP; Ambikairajah E; Epps JR; Sethu V; Choi E, 2011, 'Investigation of spectral centroid features for cognitive load classification', Speech Communication, 53, pp. 540 - 551, http://dx.doi.org/10.1016/j.specom.2011.01.005
      Journal articles | 2008
      Sethu V; Ambikairajah E; Ge L, 2008, 'Selective weighting of undecimated wavelet coefficients for noise reduction in SAR interferograms', Eurasip Journal on Advances In Signal Processing, pp. 78092 - 78099
      Journal articles | 2007
      Meng D; Sethu V; Ambikairajah E; Ge L, 2007, 'A novel technique for noise reduction in InSAR images', IEEE Geoscience and Remote Sensing Letters, 4, pp. 226 - 230, http://dx.doi.org/10.1109/LGRS.2006.888845
    • Working Papers | 2024
      Nan Z; Dang T; Sethu V; Ahmed B, 2024, A Joint Spectro-Temporal Relational Thinking Based Acoustic Modeling Framework., http://dx.doi.org
      Working Papers | 2023
      Dimitriadis A; Pan S; Sethu V; Ahmed B, 2023, Spatial HuBERT: Self-supervised Spatial Speech Representation Learning for a Single Talker from Multi-channel Audio., http://dx.doi.org
    • Conference Papers | 2025
      Ambikairajah E; Sirojan T; Sethu V, 2025, 'Tiered Assessment for DSP Education: Exploring Students' Motivation and Performance', in 2025 Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), IEEE, pp. 1847 - 1852, presented at 2025 Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 22 October 2025 - 24 October 2025, http://dx.doi.org/10.1109/apsipaasc65261.2025.11249035
      Conference Papers | 2025
      Ambikairajah E; Wu J; Dang T; Sethu V, 2025, 'A Study of Speech Embedding Similarities Between Australian Aboriginal and High-Resource Languages', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 1498 - 1502, http://dx.doi.org/10.21437/Interspeech.2025-911
      Conference Papers | 2025
      Dang T; Jeyaseelan TM; Ambikairajah E; Sethu V, 2025, 'Characterization of Speech Similarity Between Australian Aboriginal and High-Resource Languages: A Case Study on Dharawal', in 2025 Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), IEEE, pp. 658 - 663, presented at 2025 Asia Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 22 October 2025 - 24 October 2025, http://dx.doi.org/10.1109/apsipaasc65261.2025.11249066
      Preprints | 2025
      Dang T; Jeyaseelan TM; Ambikairajah E; Sethu V, 2025, Characterization of Speech Similarity Between Australian Aboriginal and High-Resource Languages: A Case Study on Dharawal, http://dx.doi.org/10.48550/arxiv.2509.01419
      Conference Papers | 2025
      Hong X; Gong Y; Sethu V; Dang T, 2025, 'AER-LLM: Ambiguity-aware Emotion Recognition Leveraging Large Language Models', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, http://dx.doi.org/10.1109/ICASSP49660.2025.10888198
      Conference Papers | 2025
      Hong X; Gong Y; Sethu V; Dang T, 2025, 'AER-LLM: Ambiguity-aware Emotion Recognition Leveraging Large Language Models.', in ICASSP, IEEE, pp. 1 - 5, https://doi.org/10.1109/ICASSP49660.2025
      Conference Papers | 2025
      Jing M; Sethu V; Ahmed B, 2025, 'Evidential Neural GPLDA: A Novel Approach to Quantify Prediction Uncertainty in Speaker Verification Systems', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, http://dx.doi.org/10.1109/ICASSP49660.2025.10887887
      Conference Papers | 2025
      Jing M; Sethu V; Ahmed B, 2025, 'Improved Out-of-domain Detection in VAE Latent Spaces with Boundary-driven Regularisation', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, http://dx.doi.org/10.1109/ICASSP49660.2025.10890806
      Conference Papers | 2025
      Meng H; Breebaart J; Stoddard J; Sethu V; Ambikairajah E, 2025, 'Blind Estimation of Sub-band Acoustic Parameters from Ambisonics Recordings using Spectro-Spatial Covariance Features', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, http://dx.doi.org/10.1109/ICASSP49660.2025.10887842
      Conference Papers | 2025
      Meng H; Breebaart J; Stoddard J; Sethu V; Ambikairajah E, 2025, 'Blind Estimation of Sub-band Acoustic Parameters from Ambisonics Recordings using Spectro-Spatial Covariance Features.', in ICASSP, IEEE, pp. 1 - 5, https://doi.org/10.1109/ICASSP49660.2025
      Preprints | 2025
      Meng H; Sethu V; Ambikairajah E; Zhang Q; Li H, 2025, Adaptive Per-Channel Energy Normalization Front-end for Robust Audio Signal Processing, http://dx.doi.org/10.48550/arxiv.2510.18206
      Preprints | 2025
      Zhang Q; Wickramasinghe B; Ambikairajah E; Sethu V; Li H, 2025, Should Audio Front-ends be Adaptive? Comparing Learnable and Adaptive Front-ends, http://dx.doi.org/10.48550/arxiv.2502.03260
      Conference Papers | 2024
      Ambikairajah E; Sirojan T; Sethu V; Mishra D, 2024, 'Aligning Tiered Assessments With Course Learning Outcomes', in 2024 IEEE International Conference on Teaching Assessment and Learning for Engineering Tale 2024 Proceedings, http://dx.doi.org/10.1109/TALE62452.2024.10834314
      Conference Papers | 2024
      Ambikairajah E; Sirojan T; Thiruvaran T; Sethu V, 2024, 'ChatGPT in the Classroom: A Shift in Engineering Design Education', in IEEE Global Engineering Education Conference Educon, http://dx.doi.org/10.1109/EDUCON60312.2024.10578884
      Conference Papers | 2024
      Ambikairajah E; Thiruvaran T; Sethu V; Mishra D; Sirojan T, 2024, 'A Tiered Learning Framework for Self-Guided Engineering Design Education', in IEEE Global Engineering Education Conference Educon, http://dx.doi.org/10.1109/EDUCON60312.2024.10578840
      Preprints | 2024
      Hong X; Gong Y; Sethu V; Dang T, 2024, AER-LLM: Ambiguity-aware Emotion Recognition Leveraging Large Language Models, http://dx.doi.org/10.48550/arxiv.2409.18339
      Conference Papers | 2024
      Jing M; Sethu V; Ahmed B, 2024, 'A PROBABILITY GRADIENT BASED APPROACH FOR SAMPLING BOUNDARIES OF IN-DOMAIN DATA', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, pp. 5340 - 5344, http://dx.doi.org/10.1109/ICASSP48485.2024.10445872
      Preprints | 2024
      Meng H; Breebaart J; Stoddard J; Sethu V; Ambikairajah E, 2024, Blind Estimation of Sub-band Acoustic Parameters from Ambisonics Recordings using Spectro-Spatial Covariance Features, http://arxiv.org/abs/2411.03172v2
      Preprints | 2024
      Meng H; Sethu V; Ambikairajah E, 2024, What is Learnt by the LEArnable Front-end (LEAF)? Adapting Per-Channel Energy Normalisation (PCEN) to Noisy Conditions, http://dx.doi.org/10.21437/Interspeech.2023-1617
      Conference Papers | 2024
      Meng H; Zhang Q; Zhang X; Sethu V; Ambikairajah E, 2024, 'Binaural Selective Attention Model for Target Speaker Extraction', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 4323 - 4327, http://dx.doi.org/10.21437/Interspeech.2024-683
      Conference Papers | 2024
      Meng H; Zhang Q; Zhang X; Sethu V; Ambikairajah E, 2024, 'Binaural Selective Attention Model for Target Speaker Extraction.', in Lapidot I; Gannot S (ed.), INTERSPEECH, ISCA, https://doi.org/10.21437/Interspeech.2024
      Preprints | 2024
      Meng H; Zhang Q; Zhang X; Sethu V; Ambikairajah E, 2024, Binaural Selective Attention Model for Target Speaker Extraction, http://arxiv.org/abs/2406.12236v1
      Conference Papers | 2024
      Nan Z; Dang T; Sethu V; Ahmed B, 2024, 'Variational Connectionist Temporal Classification for Order-Preserving Sequence Modeling', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, pp. 6495 - 6499, http://dx.doi.org/10.1109/ICASSP48485.2024.10447530
      Conference Papers | 2024
      Nan Z; Dang T; Sethu V; Ahmed B, 2024, 'Variational Connectionist Temporal Classification for Order-Preserving Sequence Modeling.', in ICASSP, IEEE, pp. 6495 - 6499, https://doi.org/10.1109/ICASSP48485.2024
      Preprints | 2024
      Nan Z; Dang T; Sethu V; Ahmed B, 2024, A Joint Spectro-Temporal Relational Thinking Based Acoustic Modeling Framework, http://dx.doi.org/10.48550/arxiv.2409.15357
      Conference Papers | 2024
      Wu J; Dang T; Sethu V; Ambikairajah E, 2024, 'Dual-Constrained Dynamical Neural ODEs for Ambiguity-aware Continuous Emotion Prediction.', in Lapidot I; Gannot S (ed.), INTERSPEECH, ISCA, https://doi.org/10.21437/Interspeech.2024
      Conference Papers | 2024
      Wu J; Dang T; Sethu V; Ambikairajah E, 2024, 'Emotion Recognition Systems Must Embrace Ambiguity', in Proceedings 2024 12th International Conference on Affective Computing and Intelligent Interaction Workshops and Demos Aciiw 2024, pp. 166 - 170, http://dx.doi.org/10.1109/ACIIW63320.2024.00033
      Preprints | 2024
      Wu J; Dang T; Sethu V; Ambikairajah E, 2024, Dual-Constrained Dynamical Neural ODEs for Ambiguity-aware Continuous Emotion Prediction, http://dx.doi.org/10.48550/arxiv.2407.21344
      Conference Papers | 2024
      Wu YT; Wu J; Sethu V; Lee CC, 2024, 'Can Modelling Inter-Rater Ambiguity Lead To Noise-Robust Continuous Emotion Predictions?', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 3714 - 3718, http://dx.doi.org/10.21437/Interspeech.2024-482
      Conference Papers | 2023
      Dang T; Dimitriadis A; Wu J; Sethu V; Ambikairajah E, 2023, 'Constrained Dynamical Neural ODE for Time Series Modelling: A Case Study on Continuous Emotion Prediction', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, http://dx.doi.org/10.1109/ICASSP49357.2023.10095778
      Preprints | 2023
      Dimitriadis A; Pan S; Sethu V; Ahmed B, 2023, Spatial HuBERT: Self-supervised Spatial Speech Representation Learning for a Single Talker from Multi-channel Audio, http://dx.doi.org/10.48550/arxiv.2310.10922
      Conference Papers | 2023
      Meng H; Sethu V; Ambikairajah E, 2023, 'What is Learnt by the LEArnable Front-end (LEAF)? Adapting Per-Channel Energy Normalisation (PCEN) to Noisy Conditions', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 2898 - 2902, http://dx.doi.org/10.21437/Interspeech.2023-1617
      Conference Papers | 2023
      Meng H; Sethu V; Ambikairajah E, 2023, 'What is Learnt by the LEArnable Front-end (LEAF)? Adapting Per-Channel Energy Normalisation (PCEN) to Noisy Conditions.', in Harte N; Carson-Berndsen J; Jones G (eds.), INTERSPEECH, ISCA, pp. 2898 - 2902, https://doi.org/10.21437/Interspeech.2023
      Preprints | 2023
      Nan Z; Dang T; Sethu V; Ahmed B, 2023, Variational Connectionist Temporal Classification for Order-Preserving Sequence Modeling, http://dx.doi.org/10.48550/arxiv.2309.11983
      Conference Papers | 2023
      Shahin M; Nan Z; Sethu V; Ahmed B, 2023, 'Improving wav2vec2-based Spoken Language Identification by Learning Phonological Features', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 4119 - 4123, http://dx.doi.org/10.21437/Interspeech.2023-2533
      Conference Papers | 2023
      Wu J; Dang T; Sethu V; Ambikairajah E, 2023, 'Belief Mismatch Coefficient (BMC): A Novel Interpretable Measure of Prediction Accuracy for Ambiguous Emotion States', in 2023 11th International Conference on Affective Computing and Intelligent Interaction Acii 2023, http://dx.doi.org/10.1109/ACII59096.2023.10388210
      Conference Papers | 2023
      Wu J; Dang T; Sethu V; Ambikairajah E, 2023, 'From Interval to Ordinal: A HMM based Approach for Emotion Label Conversion', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 1843 - 1847, http://dx.doi.org/10.21437/Interspeech.2023-2213
      Conference Papers | 2022
      Wu J; Dang T; Sethu V; Ambikairajah E, 2022, 'A NOVEL SEQUENTIAL MONTE CARLO FRAMEWORK FOR PREDICTING AMBIGUOUS EMOTION STATES', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, pp. 8567 - 8571, http://dx.doi.org/10.1109/ICASSP43922.2022.9746350
      Conference Papers | 2021
      Ahmed B; Ballard K; Burnham D; Sirojan T; Mehmood H; Estival D; Baker E; Cox F; Arciuli J; Benders T; Demuth K; Kelly B; Diskin-Holdaway C; Shahin M; Sethu V; Epps J; Lee CB; Ambikairajah E, 2021, 'AusKidTalk: An auditory-visual corpus of 3-to 12-year-old Australian children's speech', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 4351 - 4355, http://dx.doi.org/10.21437/Interspeech.2021-2000
      Conference Papers | 2021
      Ahmed B; Ballard KJ; Burnham D; Sirojan T; Mehmood H; Estival D; Baker E; Cox F; Arciuli J; Benders T; Demuth K; Kelly B; Diskin-Holdaway C; Shahin MA; Sethu V; Epps J; Lee CB; Ambikairajah E, 2021, 'AusKidTalk: An Auditory-Visual Corpus of 3- to 12-Year-Old Australian Children's Speech.', in Hermansky H; Cernocký H; Burget L; Lamel L; Scharenborg O; Motlícek P (eds.), Interspeech, ISCA, pp. 3680 - 3684, https://doi.org/10.21437/Interspeech.2021
      Conference Papers | 2021
      Bose D; Sethu V; Ambikairajah E, 2021, 'Parametric Distributions to Model Numerical Emotion Labels', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 576 - 580, http://dx.doi.org/10.21437/Interspeech.2021-1000
      Conference Papers | 2021
      Bose D; Sethu V; Ambikairajah E, 2021, 'Parametric Distributions to Model Numerical Emotion Labels.', in Hermansky H; Cernocký H; Burget L; Lamel L; Scharenborg O; Motlícek P (eds.), Interspeech, ISCA, pp. 4498 - 4502, https://doi.org/10.21437/Interspeech.2021
      Preprints | 2021
      Dang T; Sethu V; Ambikairajah E; Epps J; Li H, 2021, Joint Spatio-Temporal Discretisation of Nonlinear Active Cochlear Models, http://dx.doi.org/10.48550/arxiv.2108.05993
      Preprints | 2021
      Wu J; Dang T; Sethu V; Ambikairajah E, 2021, A Novel Markovian Framework for Integrating Absolute and Relative Ordinal Emotion Information, http://dx.doi.org/10.48550/arxiv.2108.04605
      Conference Papers | 2020
      Ambikairajah E; Sethu V, 2020, 'Cochlear Signal Processing: A Platform for Learning the Fundamentals of Digital Signal Processing', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, pp. 9229 - 9233, http://dx.doi.org/10.1109/ICASSP40776.2020.9054297
      Conference Papers | 2020
      Suthokumar G; Sethu V; Sriskandaraja K; Ambikairajah E, 2020, 'Adversarial Multi-Task Learning for Speaker Normalization in Replay Detection', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, pp. 6609 - 6613, http://dx.doi.org/10.1109/ICASSP40776.2020.9054322
      Conference Papers | 2019
      Atcheson M; Sethu V; Epps J, 2019, 'Using Gaussian Processes with LSTM Neural Networks to Predict Continuous-Time, Dimensional Emotion in Ambiguous Speech', in 2019 8th International Conference on Affective Computing and Intelligent Interaction Acii 2019, http://dx.doi.org/10.1109/ACII.2019.8925450
      Conference Papers | 2019
      Bose D; Dang T; Sethu V; Ambikairajah E; Fernando S, 2019, 'A Novel Bag-of-Optimised-Clusters Front-End for Speech based Continuous Emotion Prediction', in 2019 8th International Conference on Affective Computing and Intelligent Interaction Acii 2019, http://dx.doi.org/10.1109/ACII.2019.8925490
      Conference Papers | 2019
      Ouyang A; Dang T; Sethu V; Ambikairajah E, 2019, 'Speech based emotion prediction: Can a linear model work?', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, ISCA, Graz, Austria, pp. 2813 - 2817, presented at INTERSPEECH 2019, Graz, Austria, 15 September 2019 - 19 September 2019, http://dx.doi.org/10.21437/Interspeech.2019-3149
      Preprints | 2019
      Sethu V; Provost EM; Epps J; Busso C; Cummins N; Narayanan S, 2019, The Ambiguous World of Emotion Representation, http://dx.doi.org/10.48550/arxiv.1909.00360
      Conference Papers | 2019
      Suthokumar G; Sriskandaraja K; Sethu V; Wijenayake C; Ambikairajah E, 2019, 'Phoneme Specific Modelling and Scoring Techniques for Anti Spoofing System', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, pp. 6106 - 6110, http://dx.doi.org/10.1109/ICASSP.2019.8682411
      Conference Papers | 2019
      Wickramasinghe B; Ambikairajah E; Epps J; Sethu V; Li H, 2019, 'Auditory Inspired Spatial Differentiation for Replay Spoofing Attack Detection', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, pp. 6011 - 6015, http://dx.doi.org/10.1109/ICASSP.2019.8683693
      Conference Papers | 2018
      Atcheson M; Sethu V; Epps J, 2018, 'Demonstrating and modelling systematic time-varying annotator disagreement in continuous emotion annotation', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 3668 - 3672, http://dx.doi.org/10.21437/Interspeech.2018-1933
      Conference Papers | 2018
      Dang T; Sethu V; Ambikairajah E, 2018, 'Dynamic Multi-Rater Gaussian Mixture Regression Incorporating Temporal Dependencies of Emotion Uncertainty Using Kalman Filters', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, pp. 4929 - 4933, http://dx.doi.org/10.1109/ICASSP.2018.8461321
      Conference Papers | 2018
      Fernando S; Irtza S; Sethu V; Ambikairajah E, 2018, 'Advances in Feature Extraction and Modelling for Short Duration Language Identification', in 2018 IEEE 9th International Conference on Information and Automation for Sustainability Iciafs 2018, http://dx.doi.org/10.1109/ICIAFS.2018.8913386
      Conference Papers | 2018
      Fernando S; Sethu V; Ambikairajah E; Li H, 2018, 'Second Order Factorized Model Adaptation for Short Duration Language Identification', in 2018 Asia Pacific Signal and Information Processing Association Annual Summit and Conference Apsipa ASC 2018 Proceedings, pp. 1440 - 1447, http://dx.doi.org/10.23919/APSIPA.2018.8659586
      Conference Papers | 2018
      Fernando S; Sethu V; Ambikairajah E, 2018, 'Factorized Hidden Variability Learning for Adaptation of Short Duration Language Identification Models', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, pp. 5204 - 5208, http://dx.doi.org/10.1109/ICASSP.2018.8462094
      Conference Papers | 2018
      Fernando S; Sethu V; Ambikairajah E, 2018, 'Sub-band envelope features using frequency domain linear prediction for short duration language identification', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 1818 - 1822, http://dx.doi.org/10.21437/Interspeech.2018-1805
      Conference Papers | 2018
      Gamage KW; Dang T; Sethu V; Epps J; Ambikairajah E, 2018, 'Speech-based Continuous Emotion Prediction by Learning Perception Responses related to Salient Events: A Study based on Vocal Affect Bursts and Cross-Cultural Affect in AVEC 2018', in Avec 2018 Proceedings of the 2018 Audio Visual Emotion Challenge and Workshop Co Located with mm 2018, pp. 47 - 55, http://dx.doi.org/10.1145/3266302.3266314
      Conference Papers | 2018
      Irtza S; Sethu V; Ambikairajah E; Li H, 2018, 'End-to-End Hierarchical Language Identification System', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, pp. 5199 - 5203, http://dx.doi.org/10.1109/ICASSP.2018.8461419
      Conference Papers | 2018
      Ma J; Sethu V; Ambikairajah E; Lee KA, 2018, 'Speaker-Phonetic Vector Estimation for Short Duration Speaker Verification', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, pp. 5264 - 5268, http://dx.doi.org/10.1109/ICASSP.2018.8461978
      Conference Papers | 2018
      Sriskandaraja K; Sethu V; Ambikairajah E, 2018, 'Deep Siamese architecture based replay detection for secure voice biometric', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 671 - 675, http://dx.doi.org/10.21437/Interspeech.2018-1819
      Conference Papers | 2018
      Suthokumar G; Sethu V; Wijenayake C; Ambikairajah E, 2018, 'Modulation dynamic features for the detection of replay attacks', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 691 - 695, http://dx.doi.org/10.21437/Interspeech.2018-1846
      Conference Papers | 2018
      Suthokumar G; Sriskandaraja K; Sethu V; Wijenayake C; Ambikairajah E; Li H, 2018, 'Use of Claimed Speaker Models for Replay Detection', in 2018 Asia Pacific Signal and Information Processing Association Annual Summit and Conference Apsipa ASC 2018 Proceedings, pp. 1038 - 1046, http://dx.doi.org/10.23919/APSIPA.2018.8659510
      Conference Papers | 2018
      Suthokumar G; Sriskandaraja K; Sethu V; Wijenayake C; Ambikairajah E, 2018, 'An Investigation about the Scalability of the Spoofing Detection System', in 2018 IEEE 9th International Conference on Information and Automation for Sustainability Iciafs 2018, http://dx.doi.org/10.1109/ICIAFS.2018.8913369
      Conference Papers | 2017
      Atcheson M; Sethu V; Epps J, 2017, 'Gaussian Process Regression for Continuous Emotion Recognition with Global Temporal Invariance.', in Lawrence N; Reid M (ed.), AffComp@IJCAI, PMLR, pp. 34 - 44, http://proceedings.mlr.press/v66/
      Conference Papers | 2017
      Cetin E; Abewardana Wijenayake C; Sethu V; Ambikairajah E, 2017, 'A Flipped Mode Approach to Teaching an Electronic System Design Course', in PROCEEDINGS OF 2017 IEEE 6TH INTERNATIONAL CONFERENCE ON TEACHING, ASSESSMENT, AND LEARNING FOR ENGINEERING (TALE), IEEE, Hong Kong, pp. 223 - 228, presented at IEEE International Conference on Teaching, Assessment, and Learning for Engineering, Hong Kong, 12 December 2017 - 14 December 2017, http://dx.doi.org/10.1109/TALE.2017.8252337
      Conference Papers | 2017
      Dang T; Atcheson M; Stasak B; Hayat M; Goecke R; Huang Z; Le P; Epps J; Jayawardena S; Sethu V, 2017, 'Investigating word affect features and fusion of probabilistic predictions incorporating uncertainty in AVEC 2017', in Ringeval F; Schuller BW; Valstar MF; Gratch J; Cowie R; Pantic M (eds.), AVEC 2017 - Proceedings of the 7th Annual Workshop on Audio/Visual Emotion Challenge, co-located with MM 2017, Association for Computing Machinery (ACM), Mountain View, California, USA, pp. 27 - 35, presented at 7th Annual Workshop on Audio/Visual Emotion Challenge, Mountain View, California, USA, 23 October 2017 - 23 October 2017, http://dx.doi.org/10.1145/3133944.3133952
      Conference Papers | 2017
      Dang T; Sethu V; Epps J; Ambikairajah E, 2017, 'An investigation of emotion prediction uncertainty using Gaussian Mixture Regression', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Stockholm, Sweden, pp. 1248 - 1252, presented at INTERSPEECH 2017, Stockholm, Sweden, 20 August 2017 - 24 August 2017, http://dx.doi.org/10.21437/Interspeech.2017-512
      Conference Papers | 2017
      Fernando S; Sethu V; Ambikairajah E; Epps J, 2017, 'Bidirectional modelling for short duration language identification', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Stockholm, Sweden, pp. 2809 - 2813, presented at INTERSPEECH 2017, Stockholm, Sweden, 20 August 2017 - 24 August 2017, http://dx.doi.org/10.21437/Interspeech.2017-286
      Conference Papers | 2017
      Gamage KW; Sethu V; Ambikairajah E, 2017, 'Modeling variable length phoneme sequences - A step towards linguistic information for speech emotion recognition in wider world', in 2017 7th International Conference on Affective Computing and Intelligent Interaction Acii 2017, pp. 518 - 523, http://dx.doi.org/10.1109/ACII.2017.8273648
      Conference Papers | 2017
      Gamage KW; Sethu V; Ambikairajah E, 2017, 'Salience based lexical features for emotion recognition', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, pp. 5830 - 5834, http://dx.doi.org/10.1109/ICASSP.2017.7953274
      Conference Papers | 2017
      Irtza S; Sethu V; Ambikairajah E; Li H, 2017, 'Investigating scalability in hierarchical language identification system', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Stockholm, Sweden, pp. 2581 - 2585, presented at INTERSPEECH 2017, Stockholm, Sweden, 20 August 2017 - 24 August 2017, http://dx.doi.org/10.21437/Interspeech.2017-596
      Conference Papers | 2017
      Lee KA; Hautamäki V; Kinnunen T; Larcher A; Zhang C; Nautsch A; Stafylakis T; Liu G; Rouvier M; Rao W; Alegre F; Ma J; Mak MW; Sarkar AK; Delgado H; Saeidi R; Aronowitz H; Sizov A; Sun H; Nguyen TH; Wang G; Ma B; Vestman V; Sahidullah M; Halonen M; Kanervisto A; Le Lan G; Bahmaninezhad F; Isadskiy S; Rathgeb C; Busch C; Tzimiropoulos G; Qian Q; Wang Z; Zhao Q; Wang T; Li H; Xue J; Zhu S; Jin R; Zhao T; Bousquet PM; Ajili M; Kheder WB; Matrouf D; Lim ZH; Xu C; Xu H; Xiao X; Chng ES; Fauve B; Sriskandaraja K; Sethu V; Lin WW; Thomsen DAL; Tan ZH; Todisco M; Evans N; Li H; Hansen JHL; Bonastre JF; Ambikairajah E, 2017, 'The I4U mega fusion and collaboration for NIST speaker recognition evaluation 2016', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Stockholm, Sweden, pp. 1328 - 1332, presented at INTERSPEECH 2017, Stockholm, Sweden, 20 August 2017 - 24 August 2017, http://dx.doi.org/10.21437/Interspeech.2017-203
      Conference Papers | 2017
      Ma J; Sethu V; Ambikairajah E; Lee KA, 2017, 'Incorporating local acoustic variability information into short duration speaker verification', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Stockholm, Sweden, pp. 1502 - 1506, presented at Interspeech 2017, Stockholm, Sweden, 20 August 2017 - 24 August 2017, http://dx.doi.org/10.21437/Interspeech.2017-266
      Conference Papers | 2017
      Sriskandaraja K; Suthokumar G; Sethu V; Ambikairajah E, 2017, 'Investigating the use of scattering coefficients for replay attack detection', in Proceedings 9th Asia Pacific Signal and Information Processing Association Annual Summit and Conference Apsipa ASC 2017, pp. 1195 - 1198, http://dx.doi.org/10.1109/APSIPA.2017.8282211
      Conference Papers | 2017
      Suthokumar G; Sriskandaraja K; Sethu V; Wijenayake C; Ambikairajah E, 2017, 'Independent modelling of high and low energy speech frames for spoofing detection', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, Stockholm, Sweden, pp. 2606 - 2610, presented at INTERSPEECH 2017, Stockholm, Sweden, 20 August 2017 - 24 August 2017, http://dx.doi.org/10.21437/Interspeech.2017-836
      Conference Papers | 2016
      Dang T; Sethu V; Ambikairajah E, 2016, 'Factor analysis based speaker normalisation for continuous emotion prediction', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 913 - 917, http://dx.doi.org/10.21437/interspeech.2016-880
      Conference Papers | 2016
      Fernando S; Sethu V; Ambikairajah E, 2016, 'A feature normalisation technique for PLLR based language identification systems', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, San Francisco, CA, USA, pp. 2925 - 2929, presented at Interspeech 2016, San Francisco, CA, USA, 08 September 2016 - 12 September 2016, http://dx.doi.org/10.21437/Interspeech.2016-560
      Conference Papers | 2016
      Huang Z; Stasak B; Dang T; Gamage KW; Le P; Sethu V; Epps J, 2016, 'Staircase regression in OA RVM, data selection and gender dependency in AVEC 2016', in AVEC 2016 - Proceedings of the 6th International Workshop on Audio/Visual Emotion Challenge, co-located with ACM Multimedia 2016, ASSOC COMPUTING MACHINERY, Amsterdam, NETHERLANDS, pp. 19 - 26, presented at 6th International Workshop on Audio-Visual Emotion Recognition Challenge - Depression, Mood, and Emotion (AVEC), Amsterdam, NETHERLANDS, 16 October 2016 - 16 October 2016, http://dx.doi.org/10.1145/2988257.2988265
      Conference Papers | 2016
      Irtza S; Sethu V; Bavattichalil H; Ambikairajah E; Li H, 2016, 'A hierarchical framework for language identification', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Shanghai, China, pp. 5820 - 5824, presented at 2016 IEEE International Conference on, Shanghai, China, 20 March 2016 - 25 March 2016, http://dx.doi.org/10.1109/ICASSP.2016.7472793
      Conference Papers | 2016
      Irtza S; Sethu V; Fernando S; Ambikairajah E; Li H, 2016, 'Out of set language modelling in Hierarchical language identification', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 3270 - 3274, http://dx.doi.org/10.21437/interspeech.2016-558
      Conference Papers | 2016
      Ma J; Irtza S; Sriskandaraja K; Sethu V; Ambikairajah E, 2016, 'Parallel speaker and content modelling for text-dependent speaker verification', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, San Francisco, USA, pp. 435 - 439, presented at Interspeech 2016, San Francisco, USA, 08 September 2016 - 12 September 2016, http://dx.doi.org/10.21437/Interspeech.2016-825
      Conference Papers | 2016
      Ma J; Sethu V; Ambikairajah E; Lee KA, 2016, 'Twin model G-PLDA for duration mismatch compensation in text-independent speaker verification', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, San Francisco, USA, pp. 1853 - 1857, presented at Interspeech 2016, San Francisco, USA, 08 September 2016 - 12 September 2016, http://dx.doi.org/10.21437/Interspeech.2016-683
      Conference Papers | 2016
      Sethu V; Fernando S; Ambikairajah E, 2016, 'Eigenfeatures: An alternative to Shifted Delta Coefficients for Language Identification', in SST2016, ASSTA, Parramatta, Australia, pp. 253 - 256, presented at 16th Speech Science and Technology Conference (SST2016), Parramatta, Australia, 06 December 2016 - 09 December 2017, https://www.researchgate.net/publication/311615271_Eigenfeatures_An_alternative_to_Shifted_Delta_Coefficients_for_Language_Identification
      Conference Papers | 2016
      Sriskandaraja K; Sethu V; Le PN; Ambikairajah E, 2016, 'Investigation of sub-band discriminative information between spoofed and genuine speech', in Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH, San Francisco, USA, pp. 1710 - 1714, presented at Interspeech 2016, San Francisco, USA, 08 September 2016 - 12 September 2016, http://dx.doi.org/10.21437/Interspeech.2016-844
      Conference Papers | 2015
      Cummins N; Epps J; Sethu V; Krajewski J, 2015, 'Weighted pairwise Gaussian likelihood regression for depression score prediction', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, pp. 4779 - 4783, http://dx.doi.org/10.1109/ICASSP.2015.7178878
      Conference Papers | 2015
      Cummins N; Sethu V; Epps J; Krajewski J, 2015, 'Relevance Vector Machine for Depression Prediction', in Annual Conference of the International Speech Communication Association (Interspeech), Dresden, Germany, presented at Annual Conference of the International Speech Communication Association (Interspeech), Dresden, Germany, 06 September 2015 - 10 September 2015, http://www.isca-speech.org/archive/interspeech_2015/i15_0110.html
      Conference Papers | 2015
      Epps J; Sethu V; Eaton R; Ambikairajah E, 2015, 'High Definition Multi-View Video Guidance for Self-Directed Learning and More Effective Engineering Laboratories', Geelong,Australia, presented at Australasian Association for Engineering Education, Geelong,Australia, 06 December 2015 - 09 December 2015, https://aaee2015conference.sched.org/event/5aaZ/4b-high-definition-multi-view-video-guidance-for-self-directed-learning-and-more-effective-engineering-laboratories
      Conference Papers | 2015
      Gamage KW; Sethu V; Le P; Ambikairajah E, 2015, 'An i-vector GPLDA System for Speech based Emotion Recognition', in 2015 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA), Hong Kong, presented at The proceedings of 7th Asia-Pacific Signal and Information Processing Association Conference (APSIPA), Hong Kong, 16 December 2015 - 19 December 2015, http://dx.doi.org/10.1109/APSIPA.2015.7415522
      Conference Papers | 2015
      Hines C; Sethu V; Epps J, 2015, 'Twitter: A new online source of automatically tagged data for conversational speech emotion recognition', in ASM 2015 Proceedings of the 1st International Workshop on Affect and Sentiment in Multimedia Co Located with ACM mm 2015, pp. 9 - 14, http://dx.doi.org/10.1145/2813524.2813529
      Conference Papers | 2015
      Huang Z; Dang T; Cummins N; Stasak B; Le P; Sethu V; Epps J, 2015, 'An investigation of annotation delay compensation and output-associative fusion for multimodal continuous emotion prediction', in Avec 2015 Proceedings of the 5th International Workshop on Audio Visual Emotion Challenge Co Located with mm 2015, pp. 41 - 48, http://dx.doi.org/10.1145/2808196.2811640
      Conference Papers | 2015
      Irtza S; Bavattichalil H; Sethu V; Ambikairajah E, 2015, 'Scalable I-vector Concatenation for PLDA based Language Identification System', in The proceedings of 7th Asia-Pacific Signal and Information Processing Association Conference (APSIPA), Hong Kong, presented at The proceedings of 7th Asia-Pacific Signal and Information Processing Association Conference (APSIPA), Hong Kong, 16 December 2015 - 19 December 2015, http://ieeexplore.ieee.org/xpls/abs_all.jsp?arnumber=7415458
      Conference Papers | 2015
      Irtza S; Sethu V; Le P; Ambikairajah E; Li H, 2015, 'Phonemes Frequency Based PLLR Dimensionality Reduction for Language Recognition', Dresden, Germany, presented at In Sixteenth Annual Conference of the International Speech Communication Association (Interspeech), Dresden, Germany, 06 September 2015 - 10 September 2015
      Conference Papers | 2015
      Khlif A; Sethu V, 2015, 'An iterative multi range non-negative matrix factorization algorithm for polyphonic music transcription', in Proceedings of the 16th International Society for Music Information Retrieval Conference Ismir 2015, pp. 330 - 335
      Conference Papers | 2015
      Sriskandaraja K; Sethu V; Le P; Ambikairajah E, 2015, 'A Model Based Voice Activity Detector for Noisy Environments', Dresden, Germany, presented at Sixteenth Annual Conference of the International Speech Communication Association (Interspeech), Dresden, Germany, 06 September 2015 - 10 September 2015, http://www.isca-speech.org/archive/interspeech_2015/i15_2297.html
      Conference Papers | 2014
      Cummins N; Epps J; Sethu V; Krajewski J, 2014, 'Variability compensation in small data: Oversampled extraction of i-vectors for the classification of depressed speech', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, pp. 970 - 974, http://dx.doi.org/10.1109/ICASSP.2014.6853741
      Conference Papers | 2014
      Cummins N; Sethu V; Epps J; Krajewski J, 2014, 'Probabilistic acoustic volume analysis for speech affected by depression', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 1238 - 1242
      Conference Papers | 2014
      Kua JMK; Sethu V; Le P; Ambikairajah E, 2014, 'The UNSW submission to INTERSPEECH 2014 ComParE cognitive load challenge', in Proceedings of the Annual Conference of the International Speech Communication Association Interspeech, pp. 746 - 750
      Conference Papers | 2013
      Cummins N; Epps J; Sethu V; Breakspear M; Goecke R, 2013, 'Modeling Spectral Variability for the Classification of Depressed Speech', in INTERSPEECH 2013, 14thAnnual Conference of the International Speech Communication Association, Lyon, France, presented at 14th Annual Conference of the International Speech Communication Association Interspeech2013, Lyon, France, 25 August 2013 - 29 August 2013
      Conference Papers | 2013
      Cummins N; Joshi J; Dhall A; Sethu V; Goecke R; Epps J, 2013, 'Diagnosis of depression by behavioural signals: A multimodal approach', in Avec 2013 Proceedings of the 3rd ACM International Workshop on Audio Visual Emotion Challenge, pp. 11 - 20, http://dx.doi.org/10.1145/2512530.2512535
      Conference Papers | 2013
      Sethu V; Epps J; Ambikairajah E, 2013, 'GMM Based Speaker Variability Compensated System for Interspeech 2013 ComParE Emotion Challenge', in CERISARA C (ed.), INTERSPEECH 2013, 14thAnnual Conference of the International Speech Communication Association, Lyon, France, presented at INTERSPEECH 2013 14thAnnual Conference of the International Speech Communication Association, Lyon, France, 25 August 2013 - 29 August 2013
      Conference Papers | 2013
      Sethu V; Epps J; Ambikairajah E, 2013, 'Speaker variability in speech based emotion models - Analysis and normalisation', in ICASSP IEEE International Conference on Acoustics Speech and Signal Processing Proceedings, pp. 7522 - 7525, http://dx.doi.org/10.1109/ICASSP.2013.6639125
      Conference Papers | 2012
      Ambikairajah E; Kua JM; Sethu V; Li H, 2012, 'PNCC-ivector-SRC based Speaker Verification', in 2012 Conference Handbook - Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2012, APSIPA, Hollywood, California, USA, presented at Asia Pacific Signal and Information Processing Association, Hollywood, California, USA, 03 December 2012 - 06 December 2012
      Conference Papers | 2012
      Ding N; Sethu V; Epps JR; Ambikairajah E, 2012, 'Speaker variability in emotion recognition - An adaptation based approach', in ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, Institute of Electrical and Electronics Engineers Inc., Piscataway, NJ, pp. 5101 - 5104, presented at 2012 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2012, Kyoto, Japan, 25 March 2012 - 30 March 2012, http://dx.doi.org/10.1109/ICASSP.2012.6289068
      Conference Papers | 2011
      Le PN; Sethu V; Ambikairajah E; Kua JMK, 2011, 'Investigation of the robustness of a non-uniform filterbank for cognitive load classification', in Icics 2011 8th International Conference on Information Communications and Signal Processing, http://dx.doi.org/10.1109/ICICS.2011.6174268
      Conference Papers | 2010
      Ambikairajah E; Ibrahim RK; Sethu V, 2010, 'Novel delta zero crossing regression features for gait pattern classification', IEEE, Beunos Aires, presented at Proceedings of the 32nd Annual International Conference of the IEEE EMBS, Beunos Aires, 31 August 2010 - 04 September 2010
      Conference Papers | 2010
      Le NP; Epps JR; Ambikairajah E; Sethu V, 2010, 'Robust Speech-Based Cognitive Load Classification Using a Multi-band Approach', in The Proceedings of APSIPA ASC 2010, Asia-Pacific Signal Processing Association, Hong Kong, presented at Asia-Pacific Signal Processing Association Conf., Singapore, 14 December 2010 - 17 December 2010
      Conference Papers | 2009
      Sethu V; Ambikairajah E; Epps JR, 2009, 'Pitch Contour Prameterisation based on Linear Stylisation for Emotion Recognition', in Interspeech 2012, Curran Associates, Inc, Brighton, UK, presented at Interspeech 2009 Speech and Intelligence, Brighton, UK, 06 September 2009 - 10 September 2009
      Conference Papers | 2009
      Sethu V; Ambikairajah E; Epps JR, 2009, 'SPEAKER DEPENDENCY OF SPECTRAL FEATURES AND SPEECH PRODUCTION CUES FOR AUTOMATIC EMOTION CLASSIFICATION', in IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, IEEE, USA, presented at ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing, Taipei, Taiwan, 19 April 2009 - 24 April 2009
      Conference Papers | 2009
      Sethu V; Ambikairajah E; Epps J, 2009, 'Pitch contour parameterisation based on linear stylisation for emotion recognition', in Interspeech 2009, ISCA, presented at Interspeech 2009, http://dx.doi.org/10.21437/interspeech.2009-579
      Conference Papers | 2008
      Le NP; Ambikairajah E; Sethu V, 2008, 'Speech enhancement based on empirical mode decomposition', in Modelling, Identification and Control 2008, Innsbruck, Austria, presented at 5th IASTED International Conference on Signal Processing, Pattern Recognition and Applications 2008, Innsbruck, Austria, 13 February 2008 - 15 February 2008
      Conference Papers | 2008
      Sethu V; Ambikairajah E; Epps JR, 2008, 'Empirical mode decomposition based weighted frequency feature for speech-based emotion classification', in IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP, IEEE, USA, presented at IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2008), 31 March 2008 - 04 April 2008
      Conference Papers | 2008
      Sethu V; Ambikairajah E; Epps JR, 2008, 'Phonetic and speaker variations in automatic emotion classification', in Interspeech 2012, Curran Associates, Inc, Brisbane Australia, presented at Interspeech 2008, Brisbane Australia, 22 September 2008 - 26 September 2008
      Conference Papers | 2007
      Sethu V; Ambikairajah E; Epps JR, 2007, 'Group Delay Features for Emotion Detection', in INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECHCOMMUNICATION ASSOCIATION, VOLS 1-4, Isca-Inst Speech Communication Assoc, Baixas
      Conference Papers | 2007
      Sethu V; Ambikairajah E; Epps JR, 2007, 'Speaker normalisation for speech-based emotion detection', in 2007 15th International Conference on Digital Signal Processing, Wales, UK, presented at 15th International Conference on Digital Signal Processing 2007, Wales, UK, 01 July 2007 - 04 July 2007
      Conference Papers | 2007
      Wang Y; An J; Sethu V; Ambikairajah E, 2007, 'Perceptually motivated pre-filter for speech enhancement using Kalman filtering', in 2007 6th International Conference on Information Communications and Signal Processing Icics, http://dx.doi.org/10.1109/ICICS.2007.4449758
      Conference Papers | 2007
      Sethu V; Ambikairajah E; Epps J, 2007, 'Group delay features for emotion detection', in Interspeech 2007, ISCA, presented at Interspeech 2007, http://dx.doi.org/10.21437/interspeech.2007-617
      Conference Papers | 2006
      Ambikairajah E; Sethu V; Ge L, 2006, 'Noise reduction in SAR interferograms using undecimated wavelet transform', in 2nd international symposium on Geo-information for Disaster Management, Goa, India, presented at 2nd international symposium on Geo-information for Disaster Management, Goa, India, 25 September 2006 - 26 September 2006

    • ARC Discovery Project (2020)
    • ARC Discovery Project (2019)
    • ARC LIEF Grant (2019)
    • UNSW Research Infrastructure (2019)
    • UNSW Faculty of Engineering Research Infrastructure (2018)
    • Huawei Innovation Research Program (2018)
    • UNSW SEIF Grant (2018)
    • ARC Linkage (2017)
    • UNSW Faculty of Engineering Silverstar (2016)
    • UNSW Strategic Educational Development Grant (2014)
    • NICTA International Postgraduate Award (2006-2009)

    Research Interests include:

    • Artificial Emotional Intelligence and Speech based Emotion Recognition
    • Computational models of cochlear signal processing
    • Speaker recognition/Voice biometrics
    • Application of machine learning to signal processing tasks

    My Teaching

    I currently teach or have previously taught the following courses at UNSW:
    • Data Science for Electrical Engineers (ELEC9741)
    • Speech Processing (ELEC9723)
    • Digital Signal Processing (ELEC3104)
    • Electrical Systems Design (ELEC2117)
    • Design Proficiency (ELEC/TELE/PHTN4123)