- Machine Learning With Go
- Daniel Whitenack
- 350字
- 2021-07-08 10:37:26
Reading in CSV data from a file
Let's consider a simple CSV file, which we will return to later, named iris.csv (available here: https://archive.ics.uci.edu/ml/datasets/iris). This CSV file includes four float columns of flower measurements and a string column with the corresponding flower species:
$ head iris.csv
5.1,3.5,1.4,0.2,Iris-setosa
4.9,3.0,1.4,0.2,Iris-setosa
4.7,3.2,1.3,0.2,Iris-setosa
4.6,3.1,1.5,0.2,Iris-setosa
5.0,3.6,1.4,0.2,Iris-setosa
5.4,3.9,1.7,0.4,Iris-setosa
4.6,3.4,1.4,0.3,Iris-setosa
5.0,3.4,1.5,0.2,Iris-setosa
4.4,2.9,1.4,0.2,Iris-setosa
4.9,3.1,1.5,0.1,Iris-setosa
With encoding/csv imported, we first open the CSV file and create a CSV reader value:
// Open the iris dataset file.
f, err := os.Open("../data/iris.csv")
if err != nil {
log.Fatal(err)
}
defer f.Close()
// Create a new CSV reader reading from the opened file.
reader := csv.NewReader(f)
Then we can read in all of the records (corresponding to rows) of the CSV file. These records are imported as [][]string:
// Assume we don't know the number of fields per line. By setting
// FieldsPerRecord negative, each row may have a variable
// number of fields.
reader.FieldsPerRecord = -1
// Read in all of the CSV records.
rawCSVData, err := reader.ReadAll()
if err != nil {
log.Fatal(err)
}
We can also read in records one at a time in an infinite loop. Just make sure that you check for the end of the file (io.EOF) so that the loop ends after reading in all of your data:
// Create a new CSV reader reading from the opened file.
reader := csv.NewReader(f)
reader.FieldsPerRecord = -1
// rawCSVData will hold our successfully parsed rows.
var rawCSVData [][]string
// Read in the records one by one.
for {
// Read in a row. Check if we are at the end of the file.
record, err := reader.Read()
if err == io.EOF {
break
}
// Append the record to our dataset.
rawCSVData = append(rawCSVData, record)
}
If your CSV file is not delimited by commas and/or if your CSV file contains commented rows, you can utilize the csv.Reader.Comma and csv.Reader.Comment fields to properly handle uniquely formatted CSV files. In cases where the fields in your CSV file are single-quoted, you may need to add in a helper function to trim the single quotes and parse the values.
推薦閱讀
- Learning Scala Programming
- Bootstrap Site Blueprints Volume II
- 程序員面試白皮書
- BeagleBone Media Center
- INSTANT CakePHP Starter
- Learn Scala Programming
- SAP BusinessObjects Dashboards 4.1 Cookbook
- VMware虛擬化技術
- Terraform:多云、混合云環境下實現基礎設施即代碼(第2版)
- Illustrator CS6設計與應用任務教程
- Instant Zurb Foundation 4
- Python Deep Learning
- 軟件測試分析與實踐
- JavaScript前端開發基礎教程
- Keil Cx51 V7.0單片機高級語言編程與μVision2應用實踐