-
Notifications
You must be signed in to change notification settings - Fork 105
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
issue with csv #622
Comments
Thanks @breck7 for this issue, here is what I had found.
throws the following error:
fails with the following error:
gives the following output (missing data issues in this case):
gives the following error
So far I have tried these, let me know if you need help with something. Thanks for looking into this. |
Okay the regression was at some point I added an optimized path for faster csv generation ("buildConcepts"), and only quote escaped values that had commas, forgetting to also escape values that had no commas but did have quotes. Interestingly all the spreadsheet programs seemed to still open the poorly quote CSVs fine, but programmatically opening the CSVs didn't work. Thank you so much @swaptr for finding and reporting this! |
This fix: breck7/scroll@28964ac#r152645225 |
I recently tried loading your CSV for a data analysis project and noticed a small inconsistency in the formatting—it seems that some rows may not have the same number of fields as expected. I was wondering if you’ve used this CSV for data analysis? Is there any trick to it or any insight you could share would be really appreciated. When loading the CSV using pandas I run into the following error: pandas.errors.ParserError: Error tokenizing data. C error: Expected 396 fields in line 5, saw 852 I see similar errors when loading the TSV file and the JSON file seems to be missing some columns/fields.
The text was updated successfully, but these errors were encountered: