# Parquet File Input

## <img src="/files/bvoJ7sRAzUrhW9hHxqk1" alt="" data-size="line"> Parquet File Input

### Description <a href="#description" id="description"></a>

The Parquet File Input transform reads (primitive) values from an Apache Parquet file.

For more information on this see: [Apache Parquet](http://parquet.apache.org/).

| Hop Engine | <sup>✓</sup> |
| ---------- | ------------ |
| Spark      | <sup>✓</sup> |
| Flink      | <sup>✓</sup> |
| Dataflow   | <sup>✓</sup> |

### Options

Notes:

* To support reading from any location through Apache VFS each file is loaded into memory (one at a time). Make sure to allocate enough memory to allow this.
* Long values can be de-serialized to Dates if they are EPOC: milliseconds since `1970-01-01 00:00:00.000`
* Parquet Binary fields are considered to be Hop Strings but you can read them as Hop Binary.
* All input values are passed to the output
* INT96 is converted to the Hop Binary data type.

| Option            | Description                                                                                                                      |
| ----------------- | -------------------------------------------------------------------------------------------------------------------------------- |
| Transform name    | Name of the transform this name has to be unique in a single pipeline.                                                           |
| Filename field    | Specify the input field. Use a transform like Get File Names to obtain file names. Any supported file location is fine.          |
| Fields            | In this table you can specify all the fields you want to obtain from the parquet files as well as their desired Hop output type. |
| Get fields button | With this button you can select a parquet file from which we’ll read the schema to populate the Fields grid.                     |


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.primeur.com/data-shaper-1.21/knowing-the-data-shaper-designer/pipelines/transforms/parquet-file-input.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
