UGSpeechData - Audio speech dataset of 5 Ghanaian languages - Akan, Ewe, Dagbani, Dagaare, and Ikposo

The dataset comprises of 5000 hours speech corpus in Akan, Ewe, Dagbani, Daagare, and Ikposo. Each language includes 1000 hours of audio speech from indigenous speakers of the language and 100 hours of transcription.

Link(s) to Data Assets

Local Audios + AUDIO ID.csv

AUDIO_ID.csv Description

Column	Description
`IMAGE_URL`	Provides the relative path to the images in the folder
`IMAGE_SRC_URL`	Provides the source path to the actual image online
`AUDIO_URL`	Provides the relative path to the local audio language in the Local Audio folder
`ORG_NAME`	Identifies the institution coordinating the audio collection
`PROJECT_NAME`	Provides the name of the project
`SPEAKER_ID`	Provides the ID number of the individual describing the image
`LOCALE`	Provides the local language IETF BCP 47 language tag of the audio file
`GENDER`	Provides the individual providing the audio description gender
`AGE`	Provides the individual providing the audio description age
`DEVICE`	Identifies the device from which the audio recording was done
`ENVIRONMENT`	Identifies the space within which the audio was recorded
`YEAR`	The year in which the audio was recorded

Note: Local IDs

Locale ID	Name
`ak_gh`	Akan
`dga_gh`	Dagbani
`dag_gh`	Dagaare
`ee_gh`	Ewe
`kpo_gh`	Ikposo

CITATION

Wiafe, I., Abdulai, J., Ekpezu, A. O., Dodzi, R., Atsakpo, E. D., Nutrokpor, C., Winful, F. B. P., & Solaga, K. K. (2023). UGSPEECHDATA (Version 1.0.0) [Data set]. https://github.com/isaacwiafe/speech_data_ug

Name		Name	Last commit message	Last commit date
Latest commit History 56 Commits
CITATION.cff		CITATION.cff
LICENSE		LICENSE
README.md		README.md
google2e29f06f94a9d0f4.html		google2e29f06f94a9d0f4.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

UGSpeechData - Audio speech dataset of 5 Ghanaian languages - Akan, Ewe, Dagbani, Dagaare, and Ikposo

Link(s) to Data Assets

AUDIO_ID.csv Description

Note: Local IDs

CITATION

About

Releases

Packages

Contributors 3

Languages

License

HCI-LAB-UGSPEECHDATA/speech_data_ghana_ug

Folders and files

Latest commit

History

Repository files navigation

UGSpeechData - Audio speech dataset of 5 Ghanaian languages - Akan, Ewe, Dagbani, Dagaare, and Ikposo

Link(s) to Data Assets

AUDIO_ID.csv Description

Note: Local IDs

CITATION

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages