Skip to content

How to read the column by specific data instead of the entire parquet file? #4247

Closed Answered by tustvold
jun0315 asked this question in Q&A
Discussion options

You must be logged in to vote

The readers automatically perform IO pushdown, they will only fetch the byte ranges needed, this includes column projection, and extends through to row group and page pruning, late materialization, etc...

The readers aim to be batteries included, you shouldn't need to worry about pages, column chunks, etc... it will just do the right thing

Replies: 1 comment 2 replies

Comment options

You must be logged in to vote
2 replies
@jun0315
Comment options

@tustvold
Comment options

Answer selected by Jefffrey
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants