Skip Gram Implementation

Undertaking this project as one of my initial explorations into machine learning was an invaluable experience, greatly enriching my understanding of neural network development. Through this endeavor, I have immersed myself in the intricacies of creating neural networks, allowing me to grasp the underlying concepts with greater depth. Overall, this project played a huge role in expanding my knowledge and developing a deeper appreciation for the nuances involved in neural network development.

Since my initial implementation, I recently fully refactored my code and created an in-depth project notebook on the subject! Check out the instructions below to run it.

Instructions

To download the image from my Docker Hub repository.

Run with docker

Run docker build -t <image-name> or download the image linked above.
Run docker run -p 8888:8888 <image-name>.
Lastly, follow the link http://127.0.0.1:8888?token=docker.

Run without docker

To run the notebook you will need to install specific requirements:

Create a python virtual environment

python3 -m venv <environment-name>

Install the requirements

pip install -r requirements.txt

Run the notebook src/notebooks/skip_gram.ipynb

If you are interested, there is also a unit_test file along with relevant utils you can adapt to test out different parts of the code.

Results

The first set of results is the Tensorflow Projector of our final embeddings. Use this to get a lot of interesting information on the embeddings along with different dimensionality reduction techniques!

In order to verify the results, I tracked the vectors during the training process and observed the convergence of probabilities. The implementation relies on a straightforward Python and NumPy approach, which inherently limits its scalability. However, it performs effectively at smaller scales, fulfilling its intended purpose and providing an interactive and engaging experience.

Vector Movement Animation

The animation below visualizes the convergence of word vectors based on the word distributions in the toy corpus. As the training progresses, various semantic relationships start to emerge - this is much clearer in large scale implementations of the Skip-Gram model that have a lot of text to work with.

Probability Convergence Animation

The animation below demonstrates the model converging to the ground truth distribution of words in the context of 'treasure' within the dataset. The probabilities are sorted for comparison.

Name		Name	Last commit message	Last commit date
Latest commit History 24 Commits
data		data
imgs		imgs
src		src
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
requirements.txt		requirements.txt
term_report_caballero.pdf		term_report_caballero.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Skip Gram Implementation

Instructions

Run with docker

Run without docker

Results

Vector Movement Animation

Probability Convergence Animation

About

Releases

Packages

Languages

chris-caballero/Skip-Gram-Implementation

Folders and files

Latest commit

History

Repository files navigation

Skip Gram Implementation

Instructions

Run with docker

Run without docker

Results

Vector Movement Animation

Probability Convergence Animation

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages