Skip to content

Commit

Permalink
apacheGH-44474: [Website][Docs] Improve project description in more p…
Browse files Browse the repository at this point in the history
…laces (apache#44522)

This is a follow-up to apache/arrow-site#549 and
apache#44492. This updates the project
description in a few other places where it appears prominently in the
website and docs.
* GitHub Issue: apache#44474
  • Loading branch information
ianmcook authored Oct 24, 2024
1 parent 2bbd67d commit 7ef5437
Show file tree
Hide file tree
Showing 3 changed files with 13 additions and 12 deletions.
5 changes: 3 additions & 2 deletions README.md
Original file line number Diff line number Diff line change
Expand Up @@ -25,8 +25,9 @@

## Powering In-Memory Analytics

Apache Arrow is a development platform for in-memory analytics. It contains a
set of technologies that enable big data systems to process and move data fast.
Apache Arrow is a universal columnar format and multi-language toolbox for fast
data interchange and in-memory analytics. It contains a set of technologies that
enable data systems to efficiently store, process, and move data.

Major components of the project include:

Expand Down
15 changes: 7 additions & 8 deletions docs/source/index.rst
Original file line number Diff line number Diff line change
Expand Up @@ -20,15 +20,14 @@
Apache Arrow
============

Apache Arrow is a development platform for in-memory analytics. It contains a
set of technologies that enable big data systems to process and move data
fast. It specifies a standardized language-independent columnar memory format
for flat and hierarchical data, organized for efficient analytic operations on
modern hardware.
Apache Arrow is a universal columnar format and multi-language toolbox for fast
data interchange and in-memory analytics.

The project is developing a multi-language collection of libraries for solving
systems problems related to in-memory analytical data processing. This includes
such topics as:
The project specifies a language-independent column-oriented memory format
for flat and hierarchical data, organized for efficient analytic operations on
modern hardware. The project houses an actively developed collection of
libraries in many languages for solving problems related to data transfer and
in-memory analytical processing. This includes such topics as:

* Zero-copy shared memory and RPC-based data movement
* Reading and writing file formats (like CSV, Apache ORC, and Apache Parquet)
Expand Down
5 changes: 3 additions & 2 deletions docs/source/python/index.rst
Original file line number Diff line number Diff line change
Expand Up @@ -25,8 +25,9 @@ PyArrow - Apache Arrow Python bindings

This is the documentation of the Python API of Apache Arrow.

Apache Arrow is a development platform for in-memory analytics.
It contains a set of technologies that enable big data systems to store, process and move data fast.
Apache Arrow is a universal columnar format and multi-language toolbox for fast
data interchange and in-memory analytics. It contains a set of technologies that
enable data systems to efficiently store, process, and move data.

See the :doc:`parent documentation <../index>` for additional details on
the Arrow Project itself, on the Arrow format and the other language bindings.
Expand Down

0 comments on commit 7ef5437

Please sign in to comment.