This project focuses on enhancing data security in data analysis processes. It achieves this by:
- Utilizing a Named Entity Recognition (NER) model to pixelate sensitive information in dataframes, thereby ensuring data privacy and security.
- Large Language Model (LLM) to provide robust and efficient data processing capabilities. This combination of NER and LLM allows for secure, efficient, and accurate data analysis.
- Offering a significant improvement in data analysis coding speed for the traditional finance industry. With the power of LLM, financial analysts can code faster without worrying about data leakage.
The project includes:
- A FastAPI server for handling requests.
- A Jupyter notebook server to execute the code from front-end.
- A front-end web application for code editing and interaction.
The project can be run in a development environment or deployed using Docker for scalability and ease of use. It is an excellent choice for anyone looking to perform secure data analysis while maintaining the privacy and security of the underlying data.
cd fastapi
chmod +x start_servers.sh
./install_venv.sh
cd ../jupyter_react && npm install
AZURE_OPENAI_API_KEY = replace_with_your_api
chmod +x start_servers.sh
./start_servers.sh
This guide will help you build and start the Docker services defined in the docker-compose.yml
file.
- Docker
- Docker Compose
The docker-compose.yml
file defines the following services:
jupyter
: A Jupyter notebook server. Accessible athttp://localhost:8888
.python
: A Python with anaconda packages.julia
: A Julia kernel.rstudio
: An RStudio kernel.
fastapi-middleware
: A middle contain NER model and dataframe process. Accessible athttp://localhost:8000
.jupyter-react
: A front end web for code edit. Accessible athttp://localhost:3000
.
Before starting the services, you need to build the Docker images. Navigate to the directory containing the docker-compose.yml
file and run:
docker-compose build
Alternative
docker-compose build --parallel
docker-compose up
docker-compose down