Skip to content
View mahdiqb's full-sized avatar

Block or report mahdiqb

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
mahdiqb/README.md

Hi there 👋

I'm Mahdi, a PM building products in the data space. Before transitioning to product, I spent seven years designing and building petabyte-scale data platforms wearing different hats (data engineer, tech lead, data architect, and ML Ops engineer). I'm very passionate about open-source projects and enjoy working with data and designing scalable solutions. You can also read my content on Medium and via the Data Espresso newsletter.

The technologies I'm most familiar with:

  • Apache Spark: I used it on a daily basis for nearly four years (and so we know each other pretty well).
  • dbt: It's the tool I'm working with the most currently. I'm mainly working on defining and implementing standards, frameworks, and automation to better leverage dbt at scale. (Article from the Zendesk Engineering blog)
  • AWS Ecosystem: Worked on it for 2 years, for various data and ML projects (mostly worked with Glue, EMR, Athena, ECS, SageMaker, and the AWS CI/CD stack).
  • GCP Ecosystem: Using it currently on a daily basis (mostly working with BigQuery and GKE).
  • Hadoop: Worked with Hadoop data lakes for two and a half years (it was the ecosystem that first introduced me to distributed systems and the paradigms/concepts behind them).
  • Other notable projects/tools: Apache Superset, Apache Airflow, Apache Zeppelin, Apache Hive, Dremio, Databricks, Jupyter, and D3.js.
  • Languages I'm fluent in: Python, Java, and SQL.
  • Other languages I used in the past: C++, C#, JavaScript (Angular, Node.js), and HTML+CSS.
  • IaC: Terraform and CloudFormation.

Notable published work:

Notable presentations and podcasts:

Pinned Loading

  1. modern_data_platform modern_data_platform Public

    Sample configuration to deploy a modern data platform.

    Shell 86 20

  2. dynamic_dashboards_generator dynamic_dashboards_generator Public

    A POC for an application that leverages notebooks to generate dynamic dashboards

    Jupyter Notebook 5 1

  3. dataforgoodfr/batch8_worldbank dataforgoodfr/batch8_worldbank Public

    Jupyter Notebook 9 17

  4. mahdiqb.github.io mahdiqb.github.io Public

    Forked from jarrekk/Jalpc

    Mahdi Karabiben's website

    CSS 1