Oral cancer speech corpus for the paper "Objective speech outcomes after surgical treatment for oral cancer: An acoustic analysis of a spontaneous speech corpus containing 32.850 tokens"

Main Authors: Thomas B. Tienkamp, Rob J. J. H. van Son, Bence Mark Halpern
Format: info dataset Journal
Bahasa: eng
Terbitan: , 2022
Subjects:
Online Access: https://zenodo.org/record/6401713
Daftar Isi:
  • Dataset accompanying the paper "Objective speech outcomes after surgical treatment for oral cancer: An acoustic analysis of a spontaneous speech corpus containing 32.850 tokens" The zip file contains five folders: - Database: contains csv files for each speaker which contain the processed features - Recordings: the original recording from the YouTube Oral Cancer speech dataset, without further preprocessing - Recordings_Normalised: same as recordings but after minimal audio preprocessing (min-max scaling) - Textgrids: contains the textgrids which are annotated on the word-level and on phoneme-level - TIMIT selection: contains the textgrids for the TIMIT speakers. We unfortunately cannot share the audio date as it is not open source. More information can be found here.