dor_id: 4110119

506.#.#.a: Público

590.#.#.d: Los artículos enviados a la revista "Journal of Applied Research and Technology", se juzgan por medio de un proceso de revisión por pares

510.0.#.a: Scopus, Directory of Open Access Journals (DOAJ); Sistema Regional de Información en Línea para Revistas Científicas de América Latina, el Caribe, España y Portugal (Latindex); Indice de Revistas Latinoamericanas en Ciencias (Periódica); La Red de Revistas Científicas de América Latina y el Caribe, España y Portugal (Redalyc); Consejo Nacional de Ciencia y Tecnología (CONACyT); Google Scholar Citation

561.#.#.u: https://www.icat.unam.mx/

650.#.4.x: Ingenierías

336.#.#.b: article

336.#.#.3: Artículo de Investigación

336.#.#.a: Artículo

351.#.#.6: https://jart.icat.unam.mx/index.php/jart

351.#.#.b: Journal of Applied Research and Technology

351.#.#.a: Artículos

harvesting_group: RevistasUNAM

270.1.#.p: Revistas UNAM. Dirección General de Publicaciones y Fomento Editorial, UNAM en revistas@unam.mx

590.#.#.c: Open Journal Systems (OJS)

270.#.#.d: MX

270.1.#.d: México

590.#.#.b: Concentrador

883.#.#.u: https://revistas.unam.mx/catalogo/

883.#.#.a: Revistas UNAM

590.#.#.a: Coordinación de Difusión Cultural

883.#.#.1: https://www.publicaciones.unam.mx/

883.#.#.q: Dirección General de Publicaciones y Fomento Editorial

850.#.#.a: Universidad Nacional Autónoma de México

856.4.0.u: https://jart.icat.unam.mx/index.php/jart/article/view/658/640

100.1.#.a: Mena, Carlos Daniel Hernández; Ruiz, Ivan V. Meza; Camacho, José Abel Herrera

524.#.#.a: Mena, Carlos Daniel Hernández, et al. (2017). Automatic speech recognizers for Mexican Spanish and its open resources. Journal of Applied Research and Technology; Vol. 15 Núm. 3. Recuperado de https://repositorio.unam.mx/contenidos/4110119

245.1.0.a: Automatic speech recognizers for Mexican Spanish and its open resources

502.#.#.c: Universidad Nacional Autónoma de México

561.1.#.a: Instituto de Ciencias Aplicadas y Tecnología, UNAM

264.#.0.c: 2017

264.#.1.c: 2019-06-07

653.#.#.a: Automatic speech recognition; Mexican Spanish; Language resources; Language model; Acoustic model

506.1.#.a: La titularidad de los derechos patrimoniales de esta obra pertenece a las instituciones editoras. Su uso se rige por una licencia Creative Commons BY-NC-SA 4.0 Internacional, https://creativecommons.org/licenses/by-nc-sa/4.0/legalcode.es, para un uso diferente consultar al responsable jurídico del repositorio por medio del correo electrónico gabriel.ascanio@icat.unam.mx

884.#.#.k: https://jart.icat.unam.mx/index.php/jart/article/view/658

001.#.#.#: 074.oai:ojs2.localhost:article/658

041.#.7.h: eng

520.3.#.a: Development of automatic speech recognition systems relies on the availability of distinct language resources such as speech recordings, pronunciation dictionaries, and language models. These resources are scarce for the Mexican Spanish dialect. In this work, we present a revision of the CIEMPIESS corpus that is a resource for spontaneous speech recognition in Mexican Spanish of Central Mexico. It consists of 17 h of segmented and transcribed recordings, a phonetic dictionary composed by 53,169 unique words, and a language model composed by 1,505,491 words extracted from 2489 university newsletters. We also evaluate the CIEMPIESS corpus using three well known state of the art speech recognition engines, having satisfactory results. These resources are open for research and development in the field. Additionally, we present the methodology and the tools used to facilitate the creation of these resources which can be easily adapted to other variants of Spanish, or even other languages.

773.1.#.t: Journal of Applied Research and Technology; Vol. 15 Núm. 3

773.1.#.o: https://jart.icat.unam.mx/index.php/jart

022.#.#.a: ISSN electrónico: 2448-6736; ISSN: 1665-6423

310.#.#.a: Bimestral

264.#.1.b: Instituto de Ciencias Aplicadas y Tecnología, UNAM

doi: https://doi.org/10.1016/j.jart.2017.02.001

harvesting_date: 2023-11-08 13:10:00.0

856.#.0.q: application/pdf

file_creation_date: 2017-06-19 21:48:46.0

file_modification_date: 2017-06-19 17:19:59.0

file_creator: Carlos Daniel Hernández-Mena

file_name: 0b3d9a0e80278703119c0a11f619274d021f9e4cac6520d3d723f33ae4694805.pdf

file_pages_number: 12

file_format_version: application/pdf; version=1.7

file_size: 1483102

last_modified: 2024-03-19 14:00:00

license_url: https://creativecommons.org/licenses/by-nc-sa/4.0/legalcode.es

license_type: by-nc-sa

No entro en nada

No entro en nada 2

Artículo

Automatic speech recognizers for Mexican Spanish and its open resources

Mena, Carlos Daniel Hernández; Ruiz, Ivan V. Meza; Camacho, José Abel Herrera

Instituto de Ciencias Aplicadas y Tecnología, UNAM, publicado en Journal of Applied Research and Technology, y cosechado de Revistas UNAM

Licencia de uso

Procedencia del contenido

Cita

Mena, Carlos Daniel Hernández, et al. (2017). Automatic speech recognizers for Mexican Spanish and its open resources. Journal of Applied Research and Technology; Vol. 15 Núm. 3. Recuperado de https://repositorio.unam.mx/contenidos/4110119

Descripción del recurso

Autor(es)
Mena, Carlos Daniel Hernández; Ruiz, Ivan V. Meza; Camacho, José Abel Herrera
Tipo
Artículo de Investigación
Área del conocimiento
Ingenierías
Título
Automatic speech recognizers for Mexican Spanish and its open resources
Fecha
2019-06-07
Resumen
Development of automatic speech recognition systems relies on the availability of distinct language resources such as speech recordings, pronunciation dictionaries, and language models. These resources are scarce for the Mexican Spanish dialect. In this work, we present a revision of the CIEMPIESS corpus that is a resource for spontaneous speech recognition in Mexican Spanish of Central Mexico. It consists of 17 h of segmented and transcribed recordings, a phonetic dictionary composed by 53,169 unique words, and a language model composed by 1,505,491 words extracted from 2489 university newsletters. We also evaluate the CIEMPIESS corpus using three well known state of the art speech recognition engines, having satisfactory results. These resources are open for research and development in the field. Additionally, we present the methodology and the tools used to facilitate the creation of these resources which can be easily adapted to other variants of Spanish, or even other languages.
Tema
Automatic speech recognition; Mexican Spanish; Language resources; Language model; Acoustic model
Idioma
eng
ISSN
ISSN electrónico: 2448-6736; ISSN: 1665-6423

Enlaces