What is a promise in Javascript?

Question

Asked: 2022-10-20 16:38:03 +0800 CST 2022-10-20 16:38:03 +0800 CST 2022-10-20 16:38:03 +0800 CST

How to group by multiple DataFrame columns?

772

Good morning, could you help me with...

I have the Following DataFrame

     fecha       Medicamento     Dosis   ClaseServicio
0   2022-11-10  Vancomicina IV  4000.00 HOSPITALIZACION
1   2022-11-12  Vancomicina IV  2.00    HOSPITALIZACION
2   2022-11-01  Ceftriaxona IV  10.00   HOSPITALIZACION
3   2022-11-10  Ceftriaxona IV  10.00   HOSPITALIZACION
4   2022-11-10  Ertapenem IV    20.00   HOSPITALIZACION
5   2022-11-10  Ertapenem IV    20.00   HOSPITALIZACION
6   2022-11-10  Cefepime IV 9.00    CUIDADO CRITICO
7   2022-11-10  Cefepime IV 9.00    CUIDADO CRITICO
8   2022-11-10  Meropenem IV    30.00   HOSPITALIZACION
9   2022-11-10  Meropenem IV    15.00   CUIDADO CRITICO
10  2022-11-10  Piperacilna/tazobactam IVIV 3.00    CUIDADO CRITICO

First it should, Separate the rows by type of value Column "ClassService" , which are "HOSPITALIZATION" or "CRITICAL CARE", Second it should, it should add the Dose Column "Dose" of each type of Medication of Column "Medication"

All this to have the following output. I understand that some steps may be unnecessary Example:

This for each month you have.

I have tried in the following way:

CrossTab function pd.crosstab(df_filtro.PAV ,[ df_filtro.ClaseServicio], aggfunc = "sum", values = df_filtro.Dosis)

pd.crosstab([df_filtro.fecha,df_filtro.PAV ],[ df_filtro.ClaseServicio], aggfunc = "sum", values = df_filtro.Dosis)

But I don't know how to solve the last thing that shows me per month the sum of each PAV/Medication.

I'd appreciate your help. Cheers

1 Answers

Voted

HeytalePazguato · Answer 1 · 2022-10-22T13:04:08+08:00

Good day,

You can achieve this by using pandas.DataFrame.groupbytogether withpandas.DataFrame.unstack

Using the following dataframefrom the "sample2.csv" file as an example:

        fecha                  Medicamento   Dosis    ClaseServicio
0  2022-11-10               Vancomicina IV  4000.0  HOSPITALIZACION
1  2022-11-12               Vancomicina IV     2.0  HOSPITALIZACION
2  2022-11-01               Ceftriaxona IV    10.0  HOSPITALIZACION
3  2022-11-10               Ceftriaxona IV    10.0  HOSPITALIZACION
4  2022-11-10                 Ertapenem IV    20.0  HOSPITALIZACION
5  2022-11-10                 Ertapenem IV    20.0  HOSPITALIZACION
6  2022-11-10                  Cefepime IV     9.0  CUIDADO CRITICO
7  2022-11-10                  Cefepime IV     9.0  CUIDADO CRITICO
8  2022-11-10                 Meropenem IV    30.0  HOSPITALIZACION
9  2022-11-10                 Meropenem IV    15.0  CUIDADO CRITICO
10 2022-11-10  Piperacilna/tazobactam IVIV     3.0  CUIDADO CRITICO
11 2022-12-10               Vancomicina IV  4000.0  HOSPITALIZACION
12 2022-12-12               Vancomicina IV     2.0  HOSPITALIZACION
13 2022-12-01               Ceftriaxona IV    10.0  HOSPITALIZACION
14 2022-12-10               Ceftriaxona IV    10.0  HOSPITALIZACION
15 2022-12-10                 Ertapenem IV    20.0  HOSPITALIZACION
16 2022-12-10                 Ertapenem IV    20.0  HOSPITALIZACION
17 2022-12-10                  Cefepime IV     9.0  CUIDADO CRITICO
18 2022-12-10                  Cefepime IV     9.0  CUIDADO CRITICO
19 2022-12-10                 Meropenem IV    30.0  HOSPITALIZACION
20 2022-12-10                 Meropenem IV    15.0  CUIDADO CRITICO
21 2022-12-10  Piperacilna/tazobactam IVIV     3.0  CUIDADO CRITICO

Note: I added the same data but for December so that the result can be seen correctly.

You have to group dataframeby type of "Medication", then by month and by "Class of Service". But since you are interested in displaying the name of the month, we use pandas.Series.dt.month_namelocale esto display the names in Spanish.

Note: As a prerequisite, the "date" column must be of type datetime64[ns].

df.groupby(['Medicamento', df['fecha'].dt.month_name(locale='es'), 'ClaseServicio'])

Since the was grouped, you dataframeneed the sum of "Dose", so the previous line would look like this:

df.groupby(['Medicamento', df['fecha'].dt.month_name(locale='es'), 'ClaseServicio'])['Dosis'].sum()

Which would return the following dataframe:

Medicamento                  fecha      ClaseServicio  
Cefepime IV                  Diciembre  CUIDADO CRITICO      18.0
                             Noviembre  CUIDADO CRITICO      18.0
Ceftriaxona IV               Diciembre  HOSPITALIZACION      20.0
                             Noviembre  HOSPITALIZACION      20.0
Ertapenem IV                 Diciembre  HOSPITALIZACION      40.0
                             Noviembre  HOSPITALIZACION      40.0
Meropenem IV                 Diciembre  CUIDADO CRITICO      15.0
                                        HOSPITALIZACION      30.0
                             Noviembre  CUIDADO CRITICO      15.0
                                        HOSPITALIZACION      30.0
Piperacilna/tazobactam IVIV  Diciembre  CUIDADO CRITICO       3.0
                             Noviembre  CUIDADO CRITICO       3.0
Vancomicina IV               Diciembre  HOSPITALIZACION    4002.0
                             Noviembre  HOSPITALIZACION    4002.0

Note: Both the indices and the columns go from 0 to n, so the "Medication" column would have the index 0, "date" the index 1 and "ClassService the index 2.

We use unstack(1)so that the group of months becomes columns, now the multi index would be left with 2 elements that would be "Medication" and "ClassService", we use again `unstack(1) so that the group of "ClassService" (Which now has index 1) becomes a second "group" of columns.

Full example:

import pandas as pd

df = pd.read_csv('sample2.csv', parse_dates=['fecha'])
print(df.groupby(['Medicamento', df['fecha'].dt.month_name(locale='es'), 'ClaseServicio'])['Dosis'].sum().unstack(1).unstack(1, fill_value=0))

This prints the following dataframe:

fecha                             Diciembre                       Noviembre  
ClaseServicio               CUIDADO CRITICO HOSPITALIZACION CUIDADO CRITICO HOSPITALIZACION  
Medicamento                                                                   
Cefepime IV                            18.0             0.0            18.0             0.0
Ceftriaxona IV                          0.0            20.0             0.0            20.0
Ertapenem IV                            0.0            40.0             0.0            40.0
Meropenem IV                           15.0            30.0            15.0            30.0
Piperacilna/tazobactam IVIV             3.0             0.0             3.0             0.0
Vancomicina IV                          0.0          4002.0             0.0          4002.0

How to group by multiple DataFrame columns?

HTML button that sends you to another page

Why do I get the error "Call to undefined function mysql_connect()"?

How to create an HTML button that works as a link?

How to separate a String in Java. How to use split()

Filter by dates in sql server

How to limit the number of decimal places in a double?

For each in JavaScript?

Position footer ALWAYS glued to the footer

Definitive Guide to Type Conversion in Java

How to properly compare Strings (and objects) in Java?