Parquet
P
Parquet

tap-parquet (dataops-tk variant)

Parquet is a columnar storage format for Hadoop.

Parquet is a software tool that provides a columnar storage format for Hadoop, allowing for efficient and optimized processing of large datasets. It is designed to work with a variety of data processing frameworks, including Apache Spark, Apache Hive, and Apache Impala, and supports a wide range of data types and compression algorithms. Parquet is particularly useful for data analytics and business intelligence applications, as it enables fast and efficient querying of large datasets, while minimizing storage and processing costs.

Settings

Start Date

The date from which to start retrieving data.

Filepath

The location of the Parquet file to connect to.

Meltano Community Connector

Parquet connector is available on Meltano Community. It is built by our growing community of over 5000+ developers. Refer to the Install section below to verify the readiness of this connector.

Why Meltano?
Access to Meltano Slack communityJoin 5,500+ data engineers and analytics practitioners. The community is active, helpful, and always on. Good for quick questions, sharing patterns, and learning what others are building.