小贝子编程

如何直接读取parquet文件，而不需要在SparkSQL中注册TempTable

本文关键字：SparkSQL 注册 TempTable 不需要读取何直接 parquet 文件 apache-spark-sql
更新时间 : 2023-08-19
英文 : apache spark sql - How to read parquet file directly without registering TempTable in SparkSQL

当我在parquet文件上运行sql时，我总是像这样调用sqlContext.read.parquet() => df.registerTempTable() => sqlContext.sql():

val df = sqlContext.read.parquet("path/to/2016.05.30/")
df.registerTempTable("tab")
sqlContext.sql("SELECT * FROM tab")

Spark手册说:

Instead of using read API to load a file into DataFrame and query it, you can also query that file directly with SQL.
val df = sqlContext.sql("SELECT * FROM parquet.`examples/src/main/resources/users.parquet`")

我这样修改:

val df = sqlContext.sql("SELECT * FROM parquet.`path/to/2016.05.30/`")

但是我得到一个错误

org.apache.spark.sql.AnalysisException: no such table parquet.path/to/2016.05.30/;

如何直接查询?

直接查询文件将从Spark 1.6开始支持。请检查您正在运行的spark版本。

如何直接读取parquet文件，而不需要在SparkSQL中注册TempTable

相关内容

最新更新

热门标签：