# Execution Data Profile

## Execution Data Profile

### Description

An Apache Hop Execution Data Profile builds data profiles as data flow through pipelines. A number of data profilers can be selected and configure the fine tune the type and detail of the data that is profiled.

### Options

| Option               | Description                                                                           |
| -------------------- | ------------------------------------------------------------------------------------- |
| Name                 | The name to be used for this Execution Data Profile                                   |
| Description          | A description to be used for this Execution Data Profile                              |
| Data Samplers to use | One or more data samplers to use with this Execution Data Profile. See details below. |

### Data Samplers

| Data Sampler             | Description                                                                  | Options                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                  |
| ------------------------ | ---------------------------------------------------------------------------- | -------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
| Data profile output rows | Allow for some basic data profiling to be performed on transform output rows | <ul><li>Sample size: This is the maximum number of sample rows kept for any discovered profiling result (default: 25)</li><li>Last transforms only: only perform data profiling on pipeline endpoints (last transforms)? (default: true)</li><li>Minima: store the minimum value for this data profile (default: true)</li><li>Maxima: store the maximum value for this data profile (default: true)</li><li>Count nulls: count null values for this data profile (default: true)</li><li>Count non-nulls: count non-null values for this data profile (default: true)</li><li>Min length: store the minimum lengths for this data profile (default: true)</li><li>Max length: store the maximum lengths for this data profile (default: true)</li></ul> |
| First output rows        | Samples the first rows of a transform output                                 | Sample size (default: 100)                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |
| Last output rows         | Samples the last rows of a transform output                                  | Sample size (default: 100)                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |
| Random output rows       | Do reservoir sampling on the output rows of a transform                      | Sample size (default: 100)                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                                               |


---

# Agent Instructions: Querying This Documentation

If you need additional information that is not directly available in this page, you can query the documentation dynamically by asking a question.

Perform an HTTP GET request on the current page URL with the `ask` query parameter:

```
GET https://docs.primeur.com/data-shaper-1.21/knowing-the-data-shaper-designer/index-2/execution-data-profile.md?ask=<question>
```

The question should be specific, self-contained, and written in natural language.
The response will contain a direct answer to the question and relevant excerpts and sources from the documentation.

Use this mechanism when the answer is not explicitly present in the current page, you need clarification or additional context, or you want to retrieve related documentation sections.
