Data in csv files with less columns than schema shifts data. #16763
Labels
bug
Something isn't working
needs triage
Awaiting prioritization by a maintainer
python
Related to Python Polars
Checks
Reproducible example
Observed result:
Log output
file < 128 rows, no statistics determined no. of chunks: 1 processed by: 1 threads.
Issue description
If you pass a csv file that is missing one column from the supplied columns in the schema, the data is shifted. This used to already be the case for
read_csv
, now also forscan_csv
(since this fix: #16080, I believe). I would expect the reading and scanning function to add null's to columns that are present in the schema but not in the file.Expected behavior
Or the error:
ComputeError: found more fields than defined in 'Schema'
Installed versions
The text was updated successfully, but these errors were encountered: