What is a promise in Javascript?

Question

Asked: 2021-11-03 03:31:07 +0800 CST 2021-11-03 03:31:07 +0800 CST 2021-11-03 03:31:07 +0800 CST

如何在 Pandas 中加入 DataFrame？

772

如果我有不同的数据框Pandas，我该如何加入它们？

例如我创建了三个DataFrame：

import pandas as pd
import numpy as np

df_1 = pd.DataFrame({"fruta": ["manzana", "pera", "platano", "naranja", "aguacate"],
                    "precio": [0.20, 0.45, 0.15, 0.12, 0.62]})
df_2 = pd.DataFrame({"stock": [10, 20, 25, 12, 40]})
df_3 = pd.DataFrame({"ventas_totales":[3, 5, 2, 3, 6],
                     "ingresos_ventas": [120, 110, 64,44, 147]})

有没有办法将它们连接（加入）Pandas，还是不可能，我应该使用循环for？

如果他们有一个标识符（ID），是否 可以根据列加入他们SQL？

1 Answers

Voted

Rubiales Alberto · Answer 1 · 2021-11-03T03:31:07+08:00

Pandas 有多种方法可以加入数据框，这取决于您想要做什么，其中一种对您来说会更好。我现在将使用问题中的示例解释两种主要方式及其结果。

连接

如果你想加入不同的DataFrame，并且它们的顺序相同（也就是说，如果DataFrame 1的第1行的数据对应DataFrame 2和DataFrame 3的数据），可以这样做：

pd.concat([df_1, df_2, df_3], axis=1)

离开：

      fruta  precio  stock  ventas_totales  ingresos_ventas
0   manzana    0.20     10               3              120
1      pera    0.45     20               5              110
2   platano    0.15     25               2               64
3   naranja    0.12     12               3               44
4  aguacate    0.62     40               6              147

axix=1我们表明我们要按行连接它，如果我们把它axis=0按列连接。

优势：

快速简便
它允许我们传递一个 DataFrame 列表，加入任意数量的数据框。（当数据帧存储在列表中时非常有用.append()

缺点：

DataFrames 必须具有相同的行数
行必须对数据进行排序，因为它通过索引将它们连接起来

这种形式的同义词是：

pd.merge(df_1, df_2, left_index=True, right_index=True).merge(df_3, left_index=True, right_index=True)

结果是一样的，虽然你可以看到写起来更乏味

使用按 ID 合并

在这种情况下，我们将假设我们有一列具有标识每一行的 ID，并且它们是无序的，即 DataFrame 1 中第 1 行的 ID，我们可以在 DataFrame 的“X”行中找到它2.我放示例代码：

import pandas as pd
import numpy as np

df_1 = pd.DataFrame({"id": [1,2,3,4,5],
                    "fruta": ["manzana", "pera", "platano", "naranja", "aguacate"],
                    "precio": [0.20, 0.45, 0.15, 0.12, 0.62]})

df_2 = pd.DataFrame({"id":[5,4,3,2,1],
                     "stock": [10, 20, 25, 12, 40]})
df_3 = pd.DataFrame({"id":[4,2,5,1,3],
                     "ventas_totales":[3, 5, 2, 3, 6],
                     "ingresos_ventas": [120, 110, 64,44, 147]})

在这种情况下，我们不能使用pandas.concat(). 要加入数据框，我们可以使用与.merge()函数完全相同的方法，pandas.merge()它将允许我们选择要加入的列，以及加入的方式：

df_1.merge(df_2, on="id", how="left")

离开：

   id     fruta  precio  stock
0   1   manzana    0.20     40
1   2      pera    0.45     12
2   3   platano    0.15     25
3   4   naranja    0.12     20
4   5  aguacate    0.62     10

使用参数on=，我们指示我们要加入的列的名称，并使用参数指示how=与中相同的联合类型SQL。innerPandas 支持, left, rightouter`联合类型。

如果我们想要制作merge()两个以上的 DataFrame，我们只需要链接方法：

#unimos el primero al segundo y el resultado de dicha unión, le unimos el tercero
df_1.merge(df_2, on="id", how="left").merge(df_3, on="id", how="left")

离开：

   id     fruta  precio  stock  ventas_totales  ingresos_ventas
0   1   manzana    0.20     40               3               44
1   2      pera    0.45     12               5              110
2   3   platano    0.15     25               6              147
3   4   naranja    0.12     20               3              120
4   5  aguacate    0.62     10               2               64

强调merge()我们可以建立不同类型的联合，以SQL

如果您知道如何在Pandas. 这是官方的 pandas 文档，如果您想进一步扩展并查看不同的参数：

如何在 Pandas 中加入 DataFrame？

连接

使用按 ID 合并

HTML button that sends you to another page

Why do I get the error "Call to undefined function mysql_connect()"?

How to create an HTML button that works as a link?

How to separate a String in Java. How to use split()

Filter by dates in sql server

How to limit the number of decimal places in a double?

For each in JavaScript?

Position footer ALWAYS glued to the footer

Definitive Guide to Type Conversion in Java

How to properly compare Strings (and objects) in Java?