在Hive中,可以使用FROM 'file_path' [OPTIONS]
语句来读取外部文件,并通过ROW FORMAT
和STORED AS
子句来指定数据的格式
CREATE EXTERNAL TABLE table_name (
column1 datatype,
column2 datatype,
...
)
ROW FORMAT DELIMITED
FIELDS TERMINATED BY ','
STORED AS TEXTFILE;
CREATE EXTERNAL TABLE table_name (
column1 datatype,
column2 datatype,
...
)
ROW FORMAT SERDE 'org.openx.data.jsonserde.JsonSerDe'
WITH SERDEPROPERTIES (
"serialization.format" = "1"
)
STORED AS TEXTFILE;
CREATE EXTERNAL TABLE table_name (
column1 datatype,
column2 datatype,
...
)
ROW FORMAT SERDE 'org.apache.hadoop.hive.ql.io.parquet.serde.ParquetHiveSerDe'
WITH SERDEPROPERTIES (
"serialization.format" = "1"
)
STORED AS PARQUET;
CREATE EXTERNAL TABLE table_name (
column1 datatype,
column2 datatype,
...
)
ROW FORMAT SERDE 'org.apache.hadoop.hive.ql.io.orc.OrcSerde'
WITH SERDEPROPERTIES (
"serialization.format" = "1"
)
STORED AS ORC;
请将table_name
、column1
、column2
、datatype
等替换为实际的表名、列名和数据类型。同时,根据需要修改OPTIONS
和SERDEPROPERTIES
中的参数。
亿速云「云服务器」,即开即用、新一代英特尔至强铂金CPU、三副本存储NVMe SSD云盘,价格低至29元/月。点击查看>>
推荐阅读:hdfs hive如何进行数据格式化