reading in data whose header line only contains col names #376

vc77 · 2025-08-08T10:28:48Z

vc77
Aug 8, 2025

Warm hello. Based on what i see in the documentation of how to read in 'pandas style' csv data, it seems unless one specifies a header line whose values are of the form :::, etc., the data cannot be parsed. One can use a 'schema' but in that case one must specify the length of columns with knowledge that is otherwise not convenient to obtain (and of course hard code in the column names and their sequence).
im looking for a way to read in a csv with a less rigid format requirement (but there doesnt seem to be one), then apply the get_columns_info() method to make up for the necessary info in conjunction with other utilities on offer like load_column(). ill then have all i need to initialize a DataFrame object.
Id very much appreciate a simple confirmation that my understanding is correct that unless one supplies a header line as csv2 expects, there's no way to circumvent (temporarily) the formatting requirement in conjunction with get_columns_info(). Thanks in advance.

hosseinmoein · 2025-08-08T12:17:15Z

hosseinmoein
Aug 8, 2025
Maintainer

It is true that if you read a csv2 (Pandas like) file you must either have a header in the file or use a schema. But the number of records in header or schema are for efficiently and only once allocate memory for the given column. In other words, the number of records doesn't need to be accurate. I could be zero or an approximation or accurate. Of course if it is not accurate you may allocate more memory than you need or allocate multiple times. But in any case it would work

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

reading in data whose header line only contains col names #376

Uh oh!

{{title}}

Uh oh!

Replies: 1 comment

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Uh oh!

reading in data whose header line only contains col names #376

Uh oh!

vc77 Aug 8, 2025

Replies: 1 comment

Uh oh!

hosseinmoein Aug 8, 2025 Maintainer

vc77
Aug 8, 2025

hosseinmoein
Aug 8, 2025
Maintainer