Ingestion Step

how to develop a step without deploy infrastructure

After install requirements, install another dependencies:

pip install pytest numpy Cython pytest-docker psycopg2

So, each you run the integration test, a process will create a Kafka, Zookeeper, MongoDB and PSQL for your use. After run tests the infrastructure shut down.

Then run the integration test:

Only test multi driver

python -m pytest -x -s tests/integration/test_multi_driver.py

Only test step

python -m pytest -x -s tests/integration/test_step.py

Description

Insert data from any survey parsed and assigned by the Sorting Hat

This step performs:

calculate object statistics
lightcurve correction
previous candidates processing
insert objects
insert detections
insert non detections

Previous steps:

Sorting Hat

Next steps:

None

Database interactions

Select:

Query to get the light curve (detections and non detections).
Query if object exists in database.

Insert:

New detection.
New non-detection(s).
Objects

Previous conditions

No special conditions, only connection to kafka and database.

Version

1.0.1 https://github.com/alercebroker/ingestion_step/releases/tag/1.0.0-rc3

Libraries used

Environment variables

DB setup

DB_HOST: Database host for connection.
DB_USER: Database user for read/write (requires these permission).
DB_PASSWORD: Password of user.
DB_PORT: Port connection.
DATABASE: Name of database.

Consumer setup

CONSUMER_TOPICS: Some topics. String separated by commas. e.g: topic_one or topic_two,topic_three
CONSUMER_SERVER: Kafka host with port. e.g: localhost:9092
CONSUMER_GROUP_ID: Name for consumer group. e.g: ingestion-step
CONSUME_TIMEOUT: Max seconds to wait for a message. e.g: 60
CONSUME_MESSAGES: Ammount of messages to consume for each operation. e.g: 500
TOPIC_STRATEGY_FORMAT (optional): Topic format to format topics that change every day. e.g: ztf_{}_programid1
CONSUMER_TOPICS (optional): Topic list to consume. e.g: ztf_*

You must at least use one of TOPIC_STRATEGY_FORMAT or CONSUMER_TOPICS

Step metadata

STEP_VERSION: Current version of the step. e.g: 1.0.0
STEP_ID: Unique identifier for the step. e.g: S3
STEP_NAME: Name of the step. e.g: S3
STEP_COMMENTS: Comments of the specific version.

Stream

This step require a consumer.

Input schema

Generic Alert

Output schema

Schema

Build docker image

For use this step, first you must build the image of docker. After that you can run the step for use it.

docker build -t ingestion_step:version .

Run step

Run container of docker

You can use a docker run command, you must set all environment variables.

docker run --name ingestion_step -e DB_HOST=myhost -e [... all env ...] -d ingestion_step:version

Run docker-compose

Also you can edit the environment variables in docker-compose.yml file. After that use docker-compose up command. This run only one container.

docker-compose up -d

If you want scale this container, you must set a number of containers to run.

docker-compose up -d --scale ingestion_step=n

Note: Use docker-compose down for stop all containers.

Run the released image

For each release an image is uploaded to ghcr.io that you can use instead of building your own. To do that replace docker-compose.yml or the docker run command with this image:

docker pull ghcr.io/alercebroker/ingestion_step:latest

Local Installation

Requirements

To install the required packages run

pip install -r requirements.txt

After that you can modify the logic of the step in step.py and run

python scripts/run_step.py

Name		Name	Last commit message	Last commit date
Latest commit History 195 Commits
.github/workflows		.github/workflows
ingestion_step		ingestion_step
scripts		scripts
tests		tests
.coveragerc		.coveragerc
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
docker-compose.yml		docker-compose.yml
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
schema.py		schema.py
settings.py		settings.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Ingestion Step

how to develop a step without deploy infrastructure

Description

Previous steps:

Next steps:

Database interactions

Select:

Insert:

Previous conditions

Version

Libraries used

Environment variables

DB setup

Consumer setup

Step metadata

Stream

Input schema

Output schema

Build docker image

Run step

Run container of docker

Run docker-compose

Run the released image

Local Installation

Requirements

About

Releases 17

Packages

Contributors 4

Languages

alercebroker/ingestion_step

Folders and files

Latest commit

History

Repository files navigation

Ingestion Step

how to develop a step without deploy infrastructure

Description

Previous steps:

Next steps:

Database interactions

Select:

Insert:

Previous conditions

Version

Libraries used

Environment variables

DB setup

Consumer setup

Step metadata

Stream

Input schema

Output schema

Build docker image

Run step

Run container of docker

Run docker-compose

Run the released image

Local Installation

Requirements

About

Resources

Stars

Watchers

Forks

Releases 17

Packages 0

Contributors 4

Languages

Packages