spark根据列转行

转载

ctaxnews 2024-09-23 10:09:15

文章标签 spark根据列转行大数据 spark sql IT 文章分类 Spark 大数据

正文

| ShortType | Short |
 | IntegerType | Int |
 | LongType | Long |
 | FloatType | Float |
 | DoubleType | Double |
 | DecimalType | scala.math.BigDecimal |
 | StringType | String |
 | BinaryType | Array[Byte] |
 | BooleanType | Boolean |
 | TimestampType | java.sql.Timestamp |
 | DateType | java.sql.Date |
 | ArrayType | scala.collection.Seq |
 | MapType | scala.collection.Map |
 | StructType | org.apache.spark.sql.Row |
 | StructField | The value type in Scala of the data type of this field (For example, Int for a StructField with the data type IntegerType) |

Spark SQL数据类型转换案例

一句话描述：调用Column类的cast方法

如何获取Column类

这个之前写过

df("columnName")            // On a specific `df` DataFrame.
col("columnName")           // A generic column not yet associated with a DataFrame.
col("columnName.field")     // Extracting a struct field
col("`a.column.with.dots`") // Escape `.` in column names.
$"columnName"               // Scala short hand for a named column.

测试数据准备

1,tom,23
2,jack,24
3,lily,18
4,lucy,19

spark入口代码

val spark = SparkSession
      .builder()
      .appName("test")
      .master("local[*]")
      .getOrCreate()

测试默认数据类型

spark.read.
      textFile("./data/user")
      .map(_.split(","))
      .map(x => (x(0), x(1), x(2)))
      .toDF("id", "name", "age")
      .dtypes
      .foreach(println)

结果：

(id,StringType)
(name,StringType)
(age,StringType)

说明默认都是StringType类型

把数值型的列转为IntegerType

import spark.implicits._
    spark.read.
      textFile("./data/user")
      .map(_.split(","))
      .map(x => (x(0), x(1), x(2)))
      .toDF("id", "name", "age")
      .select($"id".cast("int"), $"name", $"age".cast("int"))
      .dtypes
      .foreach(println)

结果：

(id,IntegerType)
(name,StringType)
(age,IntegerType)

Column类cast方法的两种重载

本文章为转载内容，我们尊重原作者对文章享有的著作权。如有内容错误或侵权问题，欢迎原作者联系我们进行内容更正或删除文章。

上一篇：停车场管理系统完整java课程代码

下一篇：frpc 安卓客户端下载

提问和评论都可以，用心的回复会被更多人看到评论

发布评论

相关文章

官方博客	全部文章	热门标签	班级博客
了解我们	网站地图	意见反馈

鸿蒙开发者社区	51CTO学堂
51CTO	软考资讯

spark根据列转行

spark根据列转行

正文

Spark SQL数据类型转换案例

如何获取Column类

测试数据准备

spark入口代码

测试默认数据类型

把数值型的列转为IntegerType

Column类cast方法的两种重载

51CTO博客