我目前正在从事一个项目,但在阅读数据的一开始就被击败了。数据具有四个变量,它们是"标签","书ID","书籍标题"one_answers"作者"。它们被" tab"分开,这是一个快速的外观,
AMERICAN HISTORY b15857527 These United States Unger, Irwin
AMERICAN HISTORY b10957081 Cengage Advantage Books: American Passages Ayers, Edward L.; Gould, Lewis L.; Oshinsky, David M.; Soderlund, Jean R.
AMERICAN HISTORY b15131495 Voices of a People's History of the United States Zinn, Howard; Arnove, Anthony
现在,这是我阅读它的R代码,
train1<-read.table("train1.txt",sep="t")
然后我收到了此错误消息,
扫描中的错误(file,what,nmax,sep,dec,Quote,swip,nlines,na.strings,:: 第7行没有4个元素
我使用读取函数来检查在第7行中是否没有4个元素,但似乎很好,
cat(readLines("train1.txt")[1:8], sep = "n")
AMERICAN HISTORY b15857527 These United States Unger, Irwin
AMERICAN HISTORY b10957081 Cengage Advantage Books: American Passages Ayers, Edward L.; Gould, Lewis L.; Oshinsky, David M.; Soderlund, Jean R.
AMERICAN HISTORY b15131495 Voices of a People's History of the United States Zinn, Howard; Arnove, Anthony
AMERICAN HISTORY b15683513 American Realities Youngs, J. William T.
AMERICAN HISTORY b9418230 American History: A Survey, Volume 1 Brinkley, Alan
AMERICAN HISTORY b14348885 Liberty, Equality, Power Murrin, John M.; Johnson, Paul E.; McPherson, James M.; Gerstle, Gary; Fahs, Alice
AMERICAN HISTORY b9372860 American History: A Survey, Volume 2 Brinkley, Alan
AMERICAN HISTORY b9489206 Religion in America Hemeyer, Julia Corbett
我尝试在原始TXT文件中手动调整它,但是无论我做什么,在另一行中总会发生同样的错误,这显然很好。感谢您的任何慷慨帮助,谢谢!
问题在第3行的撇号中。read.table()
将其解释为第3行,第3列的一个元素,直到关闭撇号。定义,报价应该是什么:
train1 <- read.table("train1.txt", sep="t", quote=""")