hive 外部表迁移加载HDFS数据 hive加载数据到外部表

转载

网猴儿 2024-04-02 07:10:57

文章标签 hive 外部表迁移加载HDFS数据 hive 加载导出查询 文章分类 Hive 大数据

Hive表的数据加载

加载本地文件到数据表

$ local data local inpath '/../../.' into table table_name;

hive 外部表迁移加载HDFS数据 hive加载数据到外部表_查询

加载hdfs文件到hive表

$ load data inpath '/load_students' into student_load_hdfs;

hive 外部表迁移加载HDFS数据 hive加载数据到外部表_加载_02

覆盖表中所有数据

overwrite 关键字

$ local data local inpath '/../../.'overwrite  into table table_name ;

hive 外部表迁移加载HDFS数据 hive加载数据到外部表_导出_03

创建表时通过select加载

create table load_create_select as select stuid,stuname from student_load_hdfs;

hive 外部表迁移加载HDFS数据 hive加载数据到外部表_hive_04

创建表时通过insert加载

$ create table load_create_insert like student_load_hdfs;  
insert into table load_create_insert select * from student_load_hdfs

创建表时，指定location加载

dfs -mkdir -p /user/hive/warehouse/db_hive.db/load_location_put

hive 外部表迁移加载HDFS数据 hive加载数据到外部表_hive 外部表迁移加载HDFS数据_05

dfs -put /opt/modules/datafiles/load_students   /user/hive/warehouse/db_hive.db/load_location_put/;

hive 外部表迁移加载HDFS数据 hive加载数据到外部表_hive_06

create table  if not exists load_location_put(
stuid   int ,
stuname string ,
stuage  int 
)
row format delimited fields terminated by '\t'
stored as textfile

hive 外部表迁移加载HDFS数据 hive加载数据到外部表_hive_07

$ location '/user/hive/warehouse/db_hive.db/load_location_put';

hive 外部表迁移加载HDFS数据 hive加载数据到外部表_加载_08

导出数据(对hive表数据分析后保存)

保存到本地目录文件中

insert overwrite local directory '/opt/modules/datafiles/'  select * from load_location_put

hive 外部表迁移加载HDFS数据 hive加载数据到外部表_hive_09

保存到hdfs目录

insert overwrite directory '/hive_imp/load_localtion_put' select * from load_localtion_put;

hive 外部表迁移加载HDFS数据 hive加载数据到外部表_导出_10

重定向结果

在hive外部执行

bin/hive "select * from 
db_hive.load_location_put;">/opt/modules/datafiles/load_location_put.txt

hive 外部表迁移加载HDFS数据 hive加载数据到外部表_hive 外部表迁移加载HDFS数据_11

外部分区表

建表

create external table  if not exists student_external_part(
stuid   int ,
stuname string ,
stuage  int 
)
partitioned  by (class string) 
row format delimited fields terminated by '\t'
stored as textfile
location '/user/student_external_part'
;

加载数据

load data loacal inpath '/,./../.' into table  table_name partition(class='ss');

分区表与常规表加载数据的区别

创建常规表，加载数据

hive 外部表迁移加载HDFS数据 hive加载数据到外部表_加载_12

创建常规表与分区表对比

hive 外部表迁移加载HDFS数据 hive加载数据到外部表_加载_13

加载数据前，外部数据表无数据

hive 外部表迁移加载HDFS数据 hive加载数据到外部表_hive_14

加载数据前外部分区表的分区

hive 外部表迁移加载HDFS数据 hive加载数据到外部表_加载_15

加载数据到分区

alter table student_part add partition(class='01');

hive 外部表迁移加载HDFS数据 hive加载数据到外部表_加载_16

查看分区表信息

hive 外部表迁移加载HDFS数据 hive加载数据到外部表_加载_17

通过UI查看信息

hive 外部表迁移加载HDFS数据 hive加载数据到外部表_hive 外部表迁移加载HDFS数据_18

常规表上传数据到hdfs的表路径中，然后创建表，数据会加载到表中，而分区表则不会，还需要使用alter来加载分区

export，import 数据

export(从hive导出到hdfs) 数据

export  table student_load_hdfs to '/hive_inp/export_tables'

hive 外部表迁移加载HDFS数据 hive加载数据到外部表_导出_19

通过UI查看

hive 外部表迁移加载HDFS数据 hive加载数据到外部表_加载_20

import (从hdfs导入到hive表)数据

import table student_load_hdfs2  from '/hive_inp/export_tables';

hive 外部表迁移加载HDFS数据 hive加载数据到外部表_查询_21

sort by，distribute by ，cluster by ，order by 的作用

order by

全局排序，只作用于一个reduce

sort by

对每一个Reduce内部的数据进行排序
>set mapreduce.job.reduces=3;
>select * from emp sort by empno desc;  
>insert overwrite local directory '/opt/datafiles/sort-by/ ' select * from emp sort by empno desc;

hive 外部表迁移加载HDFS数据 hive加载数据到外部表_查询_22

结果截图

hive 外部表迁移加载HDFS数据 hive加载数据到外部表_加载_23

distribute by

作用类似于分区partitioner ，底层是mapreduce，通常与sort by进行使用，在sort by之前使用

set setmapre.job.reduces=3  
insert overwrite local directory '/opt/datafiles/sort-by/ ' select * from emp  distribute by deptno sort by empno desc;

cluster by

当sort by，distribute by 字段相同时，，用clustor代替

insert overwrite local directory '/opt/datafiles/sort-by/ ' select * from emp  cluster by empno desc;

hive 外部表迁移加载HDFS数据 hive加载数据到外部表_查询_24

输出结果

hive 外部表迁移加载HDFS数据 hive加载数据到外部表_hive_25

我这里因为插入数据间的类型不为制表符的原因导致输出的类型有问题

本文章为转载内容，我们尊重原作者对文章享有的著作权。如有内容错误或侵权问题，欢迎原作者联系我们进行内容更正或删除文章。

上一篇：WPF Grid的宽度跟随父element wpf gridview样式

下一篇：linux core_uses_pid参数 linux /proc/pid

提问和评论都可以，用心的回复会被更多人看到评论

发布评论

相关文章

官方博客	全部文章	热门标签	班级博客
了解我们	网站地图	意见反馈

鸿蒙开发者社区	51CTO学堂
51CTO	软考资讯