Procesamiento de Lenguaje Natural

Autores/as

Afiliación

INFOTEC Centro de Investigación e Innovación en Tecnologías de la Información y Comunicación

Prefacio

El curso trata de ser auto-contenido, es decir, no debería de ser necesario leer otras fuentes para poder entenderlo y realizar las actividades. De cualquier manera es importante comentar que el curso está basado en los siguientes libros de texto:

Speech and Language Processing. An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition. Third Edition draft. Daniel Jurafsky and James H. Martin. pdf
Introduction to machine learning, Third Edition. Ethem Alpaydin. MIT Press.
An Introduction to Statistical Learning with Applications in R. Gareth James, Daniela Witten, Trevor Hastie, and Robert Tibshirani. Springer Texts in Statistics.
All of Statistics. A Concise Course in Statistical Inference. Larry Wasserman. MIT Press.
An Introduction to the Bootstrap. Bradley Efron and Robert J. Tibshirani. Monographs on Statistics and Applied Probability 57. Springer-Science+Business Media.
Understanding Machine Learning: From Theory to Algorithms. Shai Shalev-Shwartz and Shai Ben-David. Cambridge University Press.

Notación

La Tabla 1.1 muestra la notación que se seguirá en este documento.

Tabla 1.1: Notación

Símbolo	Significado
$x$	Variable usada comunmente como entrada
$y$	Variable usada comunmente como salida
$\mathbb R$	Números reales
$\mathbf x$	Vector Columna $\mathbf x \in \mathbb R^d$
$d$	Dimensión
$\mathbf w \cdot \mathbf x$	Producto punto donde $\mathbf w$ y $\mathbf x \in \mathbb R^d$
$\mathcal D$	Conjunto de datos
$\mathcal T$	Conjunto de entrenamiento
$\mathcal V$	Conjunto de validación
$\mathcal G$	Conjunto de prueba
$N$	Número de ejemplos
$K$	Número de clases
$\mathbb P(\cdot)$	Probabilidad
$\mathcal X, \mathcal Y$	Variables aleatorías
$\mathcal N(\mu, \sigma^2)$	Distribución Normal con parámetros $\mu$ y $\sigma^2$
$f_{\mathcal X}$	Función de densidad de probabilidad de $\mathcal X$
$\mathbb 1(e)$	Función para indicar; $1$ only if $e$ is true
$\Omega$	Espacio de búsqueda
$\mathbb V$	Varianza
$\mathbb E$	Esperanza

Licencia

Esta obra está bajo una Licencia Creative Commons Atribución-CompartirIgual 4.0 Internacional

# Prefacio {.unnumbered} El curso trata de ser auto-contenido, es decir, no debería de ser necesario leer otras fuentes para poder entenderlo y realizar las actividades. De cualquier manera es importante comentar que el curso está basado en los siguientes libros de texto: - Speech and Language Processing. An Introduction to Natural Language Processing, Computational Linguistics, and Speech Recognition. Third Edition draft. Daniel Jurafsky and James H. Martin. [pdf](https://web.stanford.edu/~jurafsky/slp3/ed3book_sep212021.pdf) - Introduction to machine learning, Third Edition. Ethem Alpaydin. MIT Press. - An Introduction to Statistical Learning with Applications in R. Gareth James, Daniela Witten, Trevor Hastie, and Robert Tibshirani. Springer Texts in Statistics. - All of Statistics. A Concise Course in Statistical Inference. Larry Wasserman. MIT Press. - An Introduction to the Bootstrap. Bradley Efron and Robert J. Tibshirani. Monographs on Statistics and Applied Probability 57. Springer-Science+Business Media. - Understanding Machine Learning: From Theory to Algorithms. Shai Shalev-Shwartz and Shai Ben-David. Cambridge University Press. # Notación La @tbl-notacion muestra la notación que se seguirá en este documento. |Símbolo | Significado | |------------------|----------------------------------------------------------| |$x$ | Variable usada comunmente como entrada | |$y$ | Variable usada comunmente como salida | |$\mathbb R$ | Números reales | |$\mathbf x$ | Vector Columna $\mathbf x \in \mathbb R^d$ | |$d$ | Dimensión | |$\mathbf w \cdot \mathbf x$ | Producto punto donde $\mathbf w$ y $\mathbf x \in \mathbb R^d$ | |$\mathcal D$ | Conjunto de datos | |$\mathcal T$ | Conjunto de entrenamiento | |$\mathcal V$ | Conjunto de validación | |$\mathcal G$ | Conjunto de prueba | |$N$ | Número de ejemplos | |$K$ | Número de clases | |$\mathbb P(\cdot)$ | Probabilidad | |$\mathcal X, \mathcal Y$ | Variables aleatorías | |$\mathcal N(\mu, \sigma^2)$ | Distribución Normal con parámetros $\mu$ y $\sigma^2$| |$f_{\mathcal X}$| Función de densidad de probabilidad de $\mathcal X$ | |$\mathbb 1(e)$ | Función para indicar; $1$ only if $e$ is true | |$\Omega$ | Espacio de búsqueda | |$\mathbb V$ | Varianza | |$\mathbb E$ | Esperanza | : Notación {#tbl-notacion} ## Licencia [![](https://i.creativecommons.org/l/by-sa/4.0/88x31.png)](http://creativecommons.org/licenses/by-sa/4.0/) Esta obra está bajo una [Licencia Creative Commons Atribución-CompartirIgual 4.0 Internacional](http://creativecommons.org/licenses/by-sa/4.0/)

Símbolo	Significado
\(x\)	Variable usada comunmente como entrada
\(y\)	Variable usada comunmente como salida
\(\mathbb R\)	Números reales
\(\mathbf x\)	Vector Columna \(\mathbf x \in \mathbb R^d\)
\(d\)	Dimensión
\(\mathbf w \cdot \mathbf x\)	Producto punto donde \(\mathbf w\) y \(\mathbf x \in \mathbb R^d\)
\(\mathcal D\)	Conjunto de datos
\(\mathcal T\)	Conjunto de entrenamiento
\(\mathcal V\)	Conjunto de validación
\(\mathcal G\)	Conjunto de prueba
\(N\)	Número de ejemplos
\(K\)	Número de clases
\(\mathbb P(\cdot)\)	Probabilidad
\(\mathcal X, \mathcal Y\)	Variables aleatorías
\(\mathcal N(\mu, \sigma^2)\)	Distribución Normal con parámetros \(\mu\) y \(\sigma^2\)
\(f_{\mathcal X}\)	Función de densidad de probabilidad de \(\mathcal X\)
\(\mathbb 1(e)\)	Función para indicar; \(1\) only if \(e\) is true
\(\Omega\)	Espacio de búsqueda
\(\mathbb V\)	Varianza
\(\mathbb E\)	Esperanza