What is a promise in Javascript?

Question

Ares777

Asked: 2020-05-24 05:26:37 +0800 CST 2020-05-24 05:26:37 +0800 CST 2020-05-24 05:26:37 +0800 CST

PySpark - 如何从 Spark DataFrame 的 JSON 格式但类型为 String 的列中提取信息

772

我发现自己使用PySpark并使用Spark DataFrame ，其中 DataFrame 的每一行都包含此信息（将始终相同），尽管“树”、“草”和“杂草”中的值可能会有所不同“ .

{tree={in_season=true, index={color=null, category=null, value=null}, display_name=Tree, data_available=false}, weed={in_season=false, index={color=null, category=null, value=null}, display_name=Weed, data_available=false}, grass={in_season=true, index={color=null, category=null, value=null}, display_name=Grass, data_available=false}}

我想要做的是保留一些字段，例如，从“树”中保留字段“in_season”、“index -> value”、“display_name”等。

数据框具有以下架构：

df2.printSchema()

数据：地图（可为空=真）
- 键：字符串
- 值：字符串（valueContainsNull = true）
类型：字符串（可为空=真）
植物：字符串（可为空=真）

到目前为止，我尝试的是使用 StructType() 如下：

schema = ArrayType(
    StructType([StructField("tree", StringType())]))

df3 = df2.withColumn("tree", from_json(df2.types, schema))

对于数据帧的每一行，我得到的结果都是 NULL。

有没有其他方法可以做到这一点，或者我必须以另一种方式使用 StructType 吗？

非常感谢您的帮助！

1 Answers

Voted

Jino Michel Aque · Answer 1 · 2020-05-25T07:34:22+08:00

Best Answer

Jino Michel Aque

2020-05-25T07:34:22+08:002020-05-25T07:34:22+08:00

对于您的问题，使用explode 可能很有用。链接到处理它的文章：PySpark explode array and map columns to rows

0

PySpark - 如何从 Spark DataFrame 的 JSON 格式但类型为 String 的列中提取信息

HTML button that sends you to another page

Why do I get the error "Call to undefined function mysql_connect()"?

How to create an HTML button that works as a link?

How to separate a String in Java. How to use split()

Filter by dates in sql server

How to limit the number of decimal places in a double?

For each in JavaScript?

Position footer ALWAYS glued to the footer

Definitive Guide to Type Conversion in Java

How to properly compare Strings (and objects) in Java?