What is a promise in Javascript?

Question

Asked: 2020-04-26 08:51:19 +0800 CST 2020-04-26 08:51:19 +0800 CST 2020-04-26 08:51:19 +0800 CST

how to apply a for loop with filters and double filters

772

I have a df with the rating, title and gender of those who have rated a movie (there is much more data, the movies are repeated because there are many user ratings, I have simplified it to make it more visible)

df


    Rating  Title                                   Gender
0   5       One Flew Over the Cuckoo's Nest (1975)  F
1   3       James and the Giant Peach (1996)        M
2   3       My Fair Lady (1964)                     F
3   4       Erin Brockovich (2000)                  F
4   5       Bug's Life, A (1998)                    M
5   3       One Flew Over the Cuckoo's Nest (1975)  M
... ...     ...                                     ...

I must make a function that returns a df with the average user ratings per movie and separated by gender, that is, it should be something like this (with df structure or not, like this example):

media_valoraciones():
    "Calcula la puntuación media  de cada película por sexo del usuario"

        media_por_sexo                               Media_mujer    Media_hombre

        titulo 

        One Flew Over the Cuckoo's Nest (1975)       3.375000       2.761905
        James and the Giant Peach (1996)             3.388889       3.352941
        My Fair Lady (1964)                          2.675676       2.733333
        ...                                          ...            ...

I'm trying to make loops with forthis style:

for i in df.Title.unique():

    df[df.Title == i].Rating.sum()/len(df[df.Title == i])

I would read it as: for each element of the unique values of the titles, make a filter in which for each one, you add the ratings and divide it by the number of ratings, it returns nothing. I don't know if I'm going the right way.

I should also do the gender filter, for this I intend to do the gender filter first and then apply the above loop, for the gender filter I have no problem.

1 Answers

Voted

FJSevilla · Answer 1 · 2020-04-26T09:58:43+08:00

You don't need a loop, use pandas.DataFrame.groupby, group by title and genre, apply the mean to each group and unstack to get the multiindex to columns:

medias = (df.groupby(by=["Title", "Gender"])["Rating"]
            .mean()
            .unstack()
            .rename({'F': 'Media_mujer', 'M': 'Media_hombre'}, axis=1)
            )

A complete example:

import io
import pandas as pd

data = io.StringIO("""\
Rating;Title;Gender
5;One Flew Over the Cuckoo's Nest (1975);F
3;James and the Giant Peach (1996);M
3;My Fair Lady (1964);F
4;Erin Brockovich (2000);F
5;Bug's Life, A (1998);M
6;One Flew Over the Cuckoo's Nest (1975);M
3;One Flew Over the Cuckoo's Nest (1975);F
4;James and the Giant Peach (1996);M
7;My Fair Lady (1964);F
9;Erin Brockovich (2000);M
2;Bug's Life, A (1998);F
6;One Flew Over the Cuckoo's Nest (1975);F
4;One Flew Over the Cuckoo's Nest (1975);M
3;James and the Giant Peach (1996);M
2;My Fair Lady (1964);M
6;Erin Brockovich (2000);M
8;Bug's Life, A (1998);M
5;One Flew Over the Cuckoo's Nest (1975);F
""")


df = pd.read_csv(data, sep=";")


medias = (df.groupby(by=["Title", "Gender"])["Rating"]
            .mean()
            .unstack()
            .rename({'F': 'Media_mujer', 'M': 'Media_hombre'}, axis=1)
            )

>>> medias

Gender                                 Media_mujer    Media_hombre
Title     
Bug's Life, A (1998)                         2.00         6.500000
Erin Brockovich (2000)                       4.00         7.500000
James and the Giant Peach (1996)              NaN         3.333333
My Fair Lady (1964)                           5.00        2.000000
One Flew Over the Cuckoo's Nest (1975)        4.75        5.000000

how to apply a for loop with filters and double filters

HTML button that sends you to another page

Why do I get the error "Call to undefined function mysql_connect()"?

How to create an HTML button that works as a link?

How to separate a String in Java. How to use split()

Filter by dates in sql server

How to limit the number of decimal places in a double?

For each in JavaScript?

Position footer ALWAYS glued to the footer

Definitive Guide to Type Conversion in Java

How to properly compare Strings (and objects) in Java?