Data sets often contain comments that explain the data format or contain the license and usage terms. You usually want to ignore these lines when you read in the DataFrame
.
The readtable
function assumes that comment lines begin with the '#' character. However, your file may use comment marks like %
or //
. To make sure that readtable
handles these correctly, you can specify the comment mark as a keyword argument:
readtable("dataset.csv", allowcomments=true, commentmark='%')