Skip to content
Change the repository type filter

All

    Repositories list

    • velox

      Public
      A C++ vectorized database acceleration library aimed to optimizing query engines and data processing systems.
      C++
      Apache License 2.0
      1.2k003Updated Nov 25, 2024Nov 25, 2024
    • Stores benchmark data (and other resources)
      0000Updated Oct 28, 2024Oct 28, 2024
    • seatunnel

      Public
      SeaTunnel is a next-generation super high-performance, distributed, massive data integration tool.
      Java
      Apache License 2.0
      1.8k000Updated Oct 21, 2024Oct 21, 2024
    • DataX

      Public
      DataX是阿里云DataWorks数据集成的开源版本。
      Java
      Other
      5.5k000Updated Mar 8, 2024Mar 8, 2024
    • zeppelin

      Public
      Web-based notebook that enables data-driven, interactive data analytics and collaborative documents with SQL, Scala and more.
      Java
      Apache License 2.0
      2.8k000Updated Jan 16, 2024Jan 16, 2024
    • Apache DolphinScheduler is the modern data orchestration platform. Agile to create high performance workflow with low-code
      Java
      Apache License 2.0
      4.6k000Updated Jan 10, 2024Jan 10, 2024
    • arrow

      Public
      Apache Arrow is a multi-language toolbox for accelerated data interchange and in-memory processing
      C++
      Apache License 2.0
      3.6k000Updated Nov 24, 2023Nov 24, 2023
    • spark

      Public
      Apache Spark - A unified analytics engine for large-scale data processing
      Scala
      Apache License 2.0
      28k000Updated Jul 18, 2023Jul 18, 2023