我的系统是windowns 10,R 3.5.1,Rstudio 1.1.463
Rstudio 默认文本编码为 UTF-8
。
library(RMySQL)
library(haven)#读取spss
Rstudio 中上述脚本的输出为:
> library(RMySQL)
> library(haven)#
Warning message:
In readLines(file, warn = FALSE) :
invalid input found on input connection '~/.active-rstudio-document'
在 R 中运行上面的脚本时一切正常。
Rstudio似乎无法在评论中阅读中文,然后导致错误。
然后我用谷歌搜索,试图弄清楚这是否locale
问题:
Sys.getlocale()
Sys.setlocale(category = "LC_ALL", locale = "en_US.UTF-8")
#library(RMySQL)
#library(haven)#读取spss
获取输出如下:
> source('~/.active-rstudio-document', encoding = 'UTF-8', echo=TRUE)
> Sys.getlocale()
[1] "LC_COLLATE=English_United States.1252;LC_CTYPE=English_United States.1252;LC_MONETARY=English_United States.1252;LC_NUMERIC=C;LC_TIME=English_United States.1252"
> Sys.setlocale(category = "LC_ALL", locale = "en_US.UTF-8")
[1] ""
> #library(RMySQL)
> #library(haven)#
Warning messages:
1: In readLines(file, warn = FALSE) :
invalid input found on input connection '~/.active-rstudio-document'
2: In Sys.setlocale(category = "LC_ALL", locale = "en_US.UTF-8") :
OS reports request to set locale to "en_US.UTF-8" cannot be honored
根据输出,我发现了 2 个问题:
1.即使#library(haven)#读取spss
,仍然会导致Rstudio出错。
2.我无法更改语言环境English_United States.1252
如何解决这个Rstudio问题?
这是一个解决方案。花了我很长时间将Sys.setlocale + SET NAMES gbk解决方案放在一起,这就是最终工作的原因。
library(RMySQL)
Sys.setlocale(category = "LC_ALL", locale = "chinese")
sqlQuery <- function (query) {
# creating DB connection object with RMysql package
DB <- dbConnect(MySQL(),
user='',
password='',
client.flag=CLIENT_MULTI_STATEMENTS,
dbname='',
host='',
port=)
# close db connection after function call exits
on.exit(dbDisconnect(DB))
dbSendQuery(DB,"SET NAMES gbk")
# send Query to btain result set
rs <- dbSendQuery(DB, query)
# get elements from result sets and convert to dataframe
result <- fetch(rs, -1)
# return the dataframe
return(result)
}
data <- sqlQuery("SELECT * FROM TABLENAME")