Skip to content

fread fails to parse quotation mark as first character of field  #1256

@RuinedSubmartingale

Description

@RuinedSubmartingale

Consider the following files:
file1.csv

x,y
x1,"oops" y1

file2.csv

x,y
x1, "oops" y1

file3.csv

x,y
"oops" x1,y1

file4.csv

x,y
 "oops" x1,y1

Running fread('file2.csv') and fread('file4.csv') run smoothly. But fread('file1.csv') throws the following error:

Field 2 on line 2 starts with quote (") but then has a problem. It can contain balanced unescaped quoted subregions but if it does it can't contain embedded \n as well. Check for unbalanced unescaped quotes: "oops" y1

And fread('file3.csv') throws a similar error:
Error in fread("testfile.csv") :

Field 1 on line 2 starts with quote (") but then has a problem. It can contain balanced unescaped quoted subregions but if it does it can't contain embedded \n as well. Check for unbalanced unescaped quotes: "oops" x1,y1

The only difference between the two pairs of files is that file1 and file3 have fields that begin with a " character, whereas there's a space before the " in the fields for file2 and file4. Recent changes to fread seem to have caused this error in v1.9.5, seeing as how v1.9.4 is able to parse all 4 files smoothly.

Metadata

Metadata

Labels

Type

No type

Projects

No projects

Milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions