使用apachecommons获取CSV文件头

在过去的两个小时里，我一直在徒劳地寻找问题的解决方案。我正在尝试使用Apache commons读取CSV文件，我能够读取整个文件，但我的问题是如何在数组中仅提取CSV的标头？

我到处找，甚至上面的解决方案都不起作用。对于其他有这个问题的人来说，确实如此。

Iterable<CSVRecord> records;
Reader in = new FileReader(fileLocation);
records = CSVFormat.EXCEL.withHeader().withSkipHeaderRecord(false).parse(in);
Set<String> headers = records.iterator().next().toMap().keySet();

请注意，您对.next()的使用已经消耗了CSV的一行。

默认情况下，CSVParser读取的第一条记录将始终是头记录，例如在以下示例中：

CSVFormat csvFileFormat = CSVFormat.DEFAULT.withHeader(FILE_HEADER_MAPPING);
FileReader fileReader = new FileReader("file");
CSVParser csvFileParser = new CSVParser(fileReader, csvFileFormat);
List csvRecords = csvFileParser.getRecords();

csvRecords.get(0)将返回头记录。

BufferedReader br = new BufferedReader(new FileReader(filename));
CSVParser parser = CSVParser.parse(br, CSVFormat.EXCEL.withFirstRecordAsHeader());
List<String> headers = parser.getHeaderNames();

这对我很有效。最后一行就是你需要的，将解析器找到的头提取到字符串列表中。

由于Apache Commons CSV v1.9.0，withSkipHeaderRecord()&CCD_ 5方法被弃用。提供了一个生成器接口。如此使用：

CSVFormat.DEFAULT.builder()
    .setHeader()
    .setSkipHeaderRecord(true)
    .build();

在Kotlin:

val reader = File(path).bufferedReader()
val records = CSVFormat.DEFAULT.withFirstRecordAsHeader()
    .withIgnoreHeaderCase()
    .withTrim()
    .parse(reader)
println(records.headerNames)

下面的代码适用于我：

import java.io.FileReader;
import org.apache.commons.csv.*;
public static String[] headersInCSVFile (String csvFilePath) throws IOException {
        //reading file
        CSVFormat csvFileFormat = CSVFormat.DEFAULT;
        FileReader fileReader = new FileReader(csvFilePath);
        CSVParser csvFileParser = new CSVParser(fileReader, csvFileFormat);
        List csvRecords = csvFileParser.getRecords();
        
        //Obtaining first record and splitting that into an array using delimiters and removing unnecessary text
        String[] headers = csvRecords.get(0).toString().split("[,'=\]\[]+");
        String[] result = new String[headers.length - 6];
        for (int i = 6; i < headers.length; i++) {
            //.replaceAll("\s", "") removes spaces
            result[i - 6] = headers[i].replaceAll("\s", "");
        } 
        return result;
}

相关内容

最新更新

热门标签：