.

Technology

PyArrow

PyArrow is the Python API for Apache Arrow, enabling high-performance, cross-language data interchange and in-memory analytics via a standardized columnar format.

PyArrow delivers the Python bindings for the Apache Arrow project: a universal, language-independent columnar memory format. This foundation allows for zero-copy data sharing across systems like Python, C++, and Java, eliminating costly serialization. Specifically, PyArrow provides first-class integration with the Python data ecosystem, including seamless conversion with Pandas DataFrames and NumPy arrays. It accelerates I/O operations for common big data file formats (e.g., Parquet and Feather), making it an essential tool for data engineers and scientists managing large datasets and complex, multi-language data pipelines.

https://arrow.apache.org/docs/python/index.html
1 project · 2 cities

Related technologies

Recent Talks & Demos

Showing 1-1 of 1

Members-Only

Sign in to see who built these projects