Dr Philip Jackson
Senior Lecturer in Machine Audition
Qualifications: MA, PhD
Email: p.jackson@surrey.ac.uk
Phone: Work: 01483 68 6044
Room no: 30 AB 05
Further information
Biography
Further details can be found on my personal web page.
Publications
Highlights
- . (2010) 'Development and Validation of an Unintrusive Model for Predicting the Sensation of Envelopment Arising from Surround Sound Recordings'. Journal of the Audio Engineering Society, 58 (12), pp. 1013-1031.
- .
(2009) 'Model-based synthesis of visual speech movements from 3D video'. Hindawi Publishing Corporation EURASIP Journal on Audio, Speech, and Music Processing, 2009 Article number 597267 , pp. 12-12.doi: 10.1155/2009/597267Full text is available at: http://epubs.surrey.ac.uk/7693/
- .
(2009) 'Statistical identification of articulation constraints in the production of speech'. ELSEVIER SCIENCE BV SPEECH COMMUN, 51 (8), pp. 695-710.Full text is available at: http://epubs.surrey.ac.uk/7694/
- .
(2008) 'Start- and end-node segmental-HMM pruning'. INST ENGINEERING TECHNOLOGY-IET ELECTRON LETT, 44 (1), pp. 60-U77.doi: 10.1049/el:20082233
Journal articles
- . (2012) 'Performance of optimized sound field control techniques in simulated and real acoustic environments.'. J Acoust Soc Am, 131 (4)
- . (2012) 'Use of bimodal coherence to resolve the permutation problem in convolutive BSS'. Elsevier Signal Processing, 92 (8), pp. 1916-1927.
- .
(2011) 'Source localization and separation using random sample consensus with phase cues'. IEEE IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, , pp. 337-340.Full text is available at: http://epubs.surrey.ac.uk/142839/
- .
(2011) 'Integrating binaural cues and blind source separation method for separating reverberant speech mixtures'. IEEE IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings, , pp. 209-212.Full text is available at: http://epubs.surrey.ac.uk/7722/
- . (2010) 'Development and Validation of an Unintrusive Model for Predicting the Sensation of Envelopment Arising from Surround Sound Recordings'. Journal of the Audio Engineering Society, 58 (12), pp. 1013-1031.
- .
(2010) 'Estimates of perceived spatial quality across the listening area'. Audio Engineering Society Proceedings of the AES International Conference, , pp. 233-242.Full text is available at: http://epubs.surrey.ac.uk/7691/
- .
(2009) 'Model-based synthesis of visual speech movements from 3D video'. Hindawi Publishing Corporation EURASIP Journal on Audio, Speech, and Music Processing, 2009 Article number 597267 , pp. 12-12.doi: 10.1155/2009/597267Full text is available at: http://epubs.surrey.ac.uk/7693/
- .
(2009) 'Statistical identification of articulation constraints in the production of speech'. ELSEVIER SCIENCE BV SPEECH COMMUN, 51 (8), pp. 695-710.Full text is available at: http://epubs.surrey.ac.uk/7694/
- .
(2008) 'Start- and end-node segmental-HMM pruning'. INST ENGINEERING TECHNOLOGY-IET ELECTRON LETT, 44 (1), pp. 60-U77.doi: 10.1049/el:20082233
- .
(2007) 'Modelling speech signals using formant frequencies as an intermediate representation'. INST ENGINEERING TECHNOLOGY-IET IET SIGNAL PROCESS, 1 (1), pp. 43-50.Full text is available at: http://epubs.surrey.ac.uk/7744/
- .
(2006) 'Amplitude modulation of turbulence noise by voicing in fricatives'. ACOUSTICAL SOC AMER AMER INST PHYSICS J ACOUST SOC AM, 120 (6), pp. 3966-3977.doi: 10.1121/1.2358004Full text is available at: http://epubs.surrey.ac.uk/7748/
- . (2005) 'A multiple-level linear/linear segmental HMM with a formant-based intermediate layer'. ACADEMIC PRESS LTD ELSEVIER SCIENCE LTD COMPUT SPEECH LANG, 19 (2), pp. 205-225.
- .
(2005) 'Amplitude modulation of frication noise by voicing saturates'. 9th European Conference on Speech Communication and Technology, , pp. 349-352.Full text is available at: http://epubs.surrey.ac.uk/7752/
- .
(2002) 'Data-driven, nonlinear, formant-to-acoustic mapping for ASR'. IEE-INST ELEC ENG ELECTRON LETT, 38 (13), pp. 667-669.doi: 10.1049/el:20020436Full text is available at: http://epubs.surrey.ac.uk/7756/
- . (2001) 'Pitch-scaled estimation of simultaneous voiced and turbulence-noise components in speech'. IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC IEEE T SPEECH AUDI P, 9 (7), pp. 713-726.
- .
(2000) 'Frication noise modulated by voicing, as
revealed by pitch-scaled decomposition'. American Institute of Physics Journal of the Acoustical Society of America, USA: 108 (4), pp. 1421-1434.Full text is available at: http://epubs.surrey.ac.uk/7778/
Conference papers
- .
(2010) 'Audio-visual Convolutive Blind Source Separation'. London : Institution of Engineering and Technology 2010 Sensor Signal Processing for Defence Conference Proceedings (SSPD 2010), , pp. 5-5.Full text is available at: http://epubs.surrey.ac.uk/7726/
- .
(2010) 'Use of Bimodal Coherence to Resolve Spectral Indeterminacy in Convolutive BSS'. Springer Lecture Notes in Computer Science (LNCS 6365), St. Malo, France: 9th International Conference on Latent Variable Analysis and Signal Separation (formerly the International Conference on Independent Component Analysis and Signal Separation) 6365/2010, pp. 131-139.Full text is available at: http://epubs.surrey.ac.uk/7723/
- .
(2010) 'Bimodal Coherence based Scale Ambiguity Cancellation for Target Speech Extraction and Enhancement'. ISCA-International Speech Communication Association Proceedings of 11th Annual Conference of the International Speech Communication Association 2010, Makuhari, Japan: 11th Annual Conference of the International Speech Communication Association 2010, pp. 438-441.Full text is available at: http://epubs.surrey.ac.uk/7725/
- .
(2010) 'Estimates of perceived spatial quality across the listening area'. Audio Engineering Society Proceedings of AES 38th International Conference, Piteå, Sweden: AES 38th International Conference: Sound Quality Evaluation, pp. 233-242.Full text is available at: http://epubs.surrey.ac.uk/7729/
- . (2009) 'Speaker-dependent audio-visual emotion recognition'. Proc. Int. Conf. on Auditory-Visual Speech Processing (AVSP’08), Norwich, UK,
- . (2009) 'A model of jet modulation in voiced fricatives'. Proc. Int. Conf. on Acoust. NAG-DAGA2009, Rotterdam, Netherlands, , pp. 1733-1736-1733-1736.
- . (2009) 'A HYBRID ITERATIVE ALGORITHM FOR NONNEGATIVE MATRIX FACTORIZATION'. IEEE 2009 IEEE/SP 15TH WORKSHOP ON STATISTICAL SIGNAL PROCESSING, VOLS 1 AND 2, Cardiff, WALES: 15th IEEE/SP Workshop on Statistical Signal Processing, pp. 409-412.
- . (2009) 'Model-based synthesis of visual speech movements from 3D video'. Proceedings of ACM SIGGRAPH 2009: Posters, Louisiana, USA: SIGGRAPH '09
- . (2008) 'Coarticulatory constraints determined by automatic identification from articulograph data'. Strasbourg, France : Proc. 8th Int. Sem. on Spch. Prod. (ISSP’08), , pp. 377-380-377-380.
- . (2008) 'QESTRAL (Part 3): system and metrics for spatial quality prediction'. San Francisco CA: 125th Audio Engineering Society Convention
- . (2008) 'QESTRAL (Part 4): Test signals, combining metrics and the prediction of overall spatial quality'. San Francisco CA: 125th Audio Engineering Society Convention
- .
(2008) 'An Unintrusive Objective Model for Predicting the Sensation of Envelopment Arising from Surround Sound Recordings'. Proc. 125th AES Conv., San Francisco CA, Full text is available at: http://epubs.surrey.ac.uk/7735/
- . (2008) 'QESTRAL (Part 2): Calibrating the QESTRAL model using listening test data'. Proc. 125th AES Conv., San Francisco CA,
- . (2008) 'QESTRAL (Part 1): Quality Evaluation of Spatial Transmission and Reproduction using an Artificial Listener'. Proc. 125th AES Conv., San Francisco CA,
- .
(2008) 'Frication and voicing classification'. Lecture Notes in Computer Science: Computational Processing of the Portuguese Language, Aveiro, Portugal: 8th International Conference, PROPOR 2008 5190, pp. 11-20.Full text is available at: http://epubs.surrey.ac.uk/7741/
- .
(2008) 'Audio-visual feature selection and reduction for emotion classification'. Proc. Int. Conf. on Auditory-Visual Speech Processing (AVSP’08), Tangalooma, Australia, Full text is available at: http://epubs.surrey.ac.uk/7738/
- . (2008) 'Parameterisation of Speech Lip Movements'. Proceedings of International Conference on Auditory-visual Speech Processing, Tangalooma, Australia: AVSP
- .
(2008) 'Parallel model combination and word recognition in soccer audio'. IEEE 2008 IEEE INTERNATIONAL CONFERENCE ON MULTIMEDIA AND EXPO, VOLS 1-4, Hannover, Germany: IEEE International Conference on Multimedia and Expo (ICME 2008), pp. 1465-1468.Full text is available at: http://epubs.surrey.ac.uk/7743/
- .
(2007) 'Statistical identification of critical, dependent and redundant articulators'. ISCA-INST SPEECH COMMUNICATION ASSOC INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, Antwerp, BELGIUM: Interspeech Conference 2007, pp. 2736-2739.Full text is available at: http://epubs.surrey.ac.uk/7745/
- .
(2007) 'Time-frequency-modulation representation of stochastic signals'. IEEE 2007 15th International Conference on Digital Signal Processing, DSP 2007, Cardiff: 15th International Conference on Digital Signal Processing, pp. 639-642.Full text is available at: http://epubs.surrey.ac.uk/7746/
- .
(2007) 'Visual Analysis of Lip Coarticulation in VCV Utterances'. ISCA-INST SPEECH COMMUNICATION ASSOC INTERSPEECH 2007: 8TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION, VOLS 1-4, Antwerp, BELGIUM: Interspeech Conference 2007, pp. 1281-1284.Full text is available at: http://epubs.surrey.ac.uk/7747/
- .
(2006) 'Representing Dynamics of Facial Expression'. IET European Conference on Visual Media Production, IET 3rd European Conference on Visual Media Production, pp. 183-183.Full text is available at: http://epubs.surrey.ac.uk/111040/
- .
(2006) 'Enhancement of harmonic content of speech based on a dynamic programming pitch tracking algorithm'. ISCA-INST SPEECH COMMUNICATION ASSOC INTERSPEECH 2006 AND 9TH INTERNATIONAL CONFERENCE ON SPOKEN LANGUAGE PROCESSING, VOLS 1-5, Pittsburgh, PA: 9th International Conference on Spoken Language Processing/INTERSPEECH 2006, pp. 81-84.Full text is available at: http://epubs.surrey.ac.uk/7749/
- .
(2005) 'Amplitude modulation of frication noise by voicing saturates'. Lisbon : Proc. Interspeech ’05, Lisbon, Portugal: , pp. 4-4.Full text is available at: http://epubs.surrey.ac.uk/7750/
- . (2005) 'Objective assessment of spatial localisation attributes of surround-sound reproduction systems'. Barcelona : Sound: 108th AES Convention
- .
(2004) 'Speech Driven Face Synthesis from 3D Video'. IEEE IEEE Symposium on 3D Data Processing, Visualisation and Transmission, Thessaloniki, Greece: 2nd International Symposium on 3D Data Processing, Visualization and Transmission, pp. 58-65.Full text is available at: http://epubs.surrey.ac.uk/7754/
- . (2003) 'Development of articulatory-based multi-level segmental HMMs for phonetic classification in ASR'. Faculty of Electrical Engineering and Computing, Zagreb, Croatia PROCEEDINGS EC-VIP-MC 2003, VOL 2, Zagreb, Croatia: 4th EURASIP Conference on Video, Image Processing and Multimedia Communications, pp. 655-660.
- .
(2000) 'Frication noise modulated by voicing, as revealed by pitch-scaled decomposition'. AMER INST PHYSICS JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA, BERLIN, GERMANY: 2nd International Conference on Voice Physiology and Biomechanics (ICVPB) 4 (108), pp. 1421-1434.doi: 10.1121/1.1289207Full text is available at: http://epubs.surrey.ac.uk/7758/
- .
(2000) 'Performance of the pitch-scaled harmonic filter and applications in speech analysis'. IEEE 2000 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, PROCEEDINGS, VOLS I-VI, ISTANBUL, TURKEY: IEEE International Conference on Acoustics, Speech, and Signal Processing, pp. 1311-1314.Full text is available at: http://epubs.surrey.ac.uk/7759/
Book chapters
- .
(2010) 'Multimodal Emotion Recognition'. in Wang W (ed.) Machine Audition: Principles, Algorithms and Systems
IGI Global Article number 17 , pp. 398-423.Full text is available at: http://epubs.surrey.ac.uk/7730/
- . (2005) 'Mama and papa: the ancestors of modern-day speech science'. in Smith CUM, Arnott R (eds.) The Genius of Erasmus Darwin Aldershot, UK : Ashgate , pp. 217-236.
Theses and dissertations
- .
(2000) Characterisation of plosive, fricative and aspiration components in speech production. Full text is available at: http://epubs.surrey.ac.uk/7760/

