David GG's Questions

Asked: 2020-05-01 09:39:19 +0800 CST

Assign colors to a confusion matrix in Python

I have an 18x18 confusion matrix. For better error visualization I want to colormap based on the value we have in our normalized array.

I want to make a color gradation in which to visualize the data more adequately. For example, the zones in which the correct answers are greater than 80%, would have a different color scale than the zones in which the correct answers are between 20% and 80%. Finally, the areas where the hits are less than 20%, would have a different color gradation.

I have the following code (where we make a dataframe and with it our confusion matrix).

import os
import numpy as np
import pandas as pd
import seaborn as sn
    
working_path = os.getcwd() #sirve para establecer en qué carpeta estamos trabajando (ruta), ahora todos los archivos q se encuentren en esa carpeta solo los tenemos que llamar con su nombre
    
df = pd.read_csv("salida.txt",delimiter="\t") #Hacemos un dataframe, importando el archivo txt separado por tabuladores
    
df.rename(columns={'Number of Syllables': 'NSyllables'}, inplace = True) #Cambiamos (acortamos) nombres de la columna que indica el nº de sílabas

conf_matC = pd.crosstab(df['TargetC'], df['RespC'], rownames=['Target'], colnames=['Response'], margins = True); #Matriz de confusión para CONSONANTES
ncmC = conf_matC.drop(["All", "**"], axis = 0); #Quitamos la fila All (no da información relevante) Consonantes
confusion_matrixC = ncmC.drop(["All", "**","gr", "zr"], axis = 1); #Matriz de confusión total Consonantes - Quitamos la columna All (no nos da información) Consonantes
ncmC1 = confusion_matrixC/confusion_matrixC.max().astype(np.float64); #Normalizamos matriz confusión Consonantes
normarlize_confusion_matrixC = ncmC1.round(2); #Redondeamos los datos a dos decimales en df Consonantes

printed_matrixC = sn.heatmap(normarlize_confusion_matrixC, cmap='Oranges', annot=False); #Imprimimos matriz de confusion con mapa de calor, Consonantes

With this code, we get the following array:

Response     b   ch     d    f     g     k  ...    rr     s     t    x    y     z
Target                                      ...                                  
b         1.00  0.0  0.10  0.0  0.08  0.00  ...  0.00  0.00  0.00  0.0  0.0  0.00
ch        0.00  1.0  0.00  0.0  0.00  0.00  ...  0.00  0.00  0.00  0.0  0.0  0.00
d         0.00  0.0  1.00  0.0  0.08  0.00  ...  0.08  0.00  0.02  0.0  0.0  0.00
f         0.00  0.0  0.00  1.0  0.00  0.00  ...  0.00  0.00  0.00  0.0  0.0  0.36
g         0.03  0.0  0.00  0.0  1.00  0.00  ...  0.00  0.00  0.00  0.0  0.0  0.00
k         0.00  0.0  0.00  0.0  0.00  1.00  ...  0.00  0.00  0.00  0.0  0.0  0.00
l         0.00  0.0  0.03  0.0  0.00  0.00  ...  0.00  0.00  0.00  0.0  0.0  0.00
m         0.03  0.0  0.00  0.0  0.08  0.00  ...  0.08  0.00  0.00  0.0  0.0  0.00
n         0.00  0.0  0.10  0.0  0.00  0.00  ...  0.08  0.00  0.00  0.0  0.0  0.00
ny        0.00  0.0  0.00  0.0  0.00  0.00  ...  0.00  0.00  0.00  0.0  0.0  0.00
p         0.00  0.0  0.00  0.0  0.00  0.00  ...  0.00  0.00  0.04  0.0  0.0  0.00
r         0.00  0.0  0.00  0.0  0.00  0.00  ...  0.08  0.00  0.02  0.0  0.0  0.00
rr        0.00  0.0  0.00  0.0  0.00  0.00  ...  1.00  0.00  0.00  0.0  0.0  0.00
s         0.00  0.0  0.00  0.0  0.00  0.00  ...  0.00  1.00  0.00  0.0  0.0  0.00
t         0.00  0.0  0.00  0.0  0.00  0.09  ...  0.00  0.00  1.00  0.0  0.0  0.12
x         0.00  0.0  0.00  0.0  0.00  0.00  ...  0.00  0.00  0.00  1.0  0.0  0.04
y         0.00  0.0  0.00  0.0  0.00  0.00  ...  0.00  0.00  0.00  0.0  1.0  0.00
z         0.00  0.0  0.00  0.0  0.00  0.00  ...  0.00  0.12  0.00  0.0  0.0  1.00

Seen as an image in the dataframe:

At first I have made a heatmap of my confusion matrix, but it gives me several display errors as shown in the following image.

How could I solve this problem and print the matrix by colors without getting the numbers cut off? Could you make different color gradations according to the degree of success as explained above?

Thank you very much

David GG

Asked: 2020-04-25 16:44:32 +0800 CST

How do I convert a large confusion matrix to a 2x2 matrix in python? All this being both a dataframe

I have a dataframe with a confusion matrix (5x5) with the following data:

I would like to convert this (5x5) matrix into 5 (2x2) confusion matrices, one for each of the letters a,e,i,o,u). For example, for the letter "a", it would have in position [1,1] it would have the times that both the prediction and the result were "a" (correct). At position [2,1], you would have the times that the result is not "a", but the program has predicted that it is (error). At position [1,2], it would have the times that the result is "a", but the program has not recognized a (error). In position [2,2], it would have the times that neither the prediction nor the result has been "a", that is, the rest of the cases.

Something like what you see in the attached image.

To get to the confusion matrix of the first image, I have made this code:

import os
import numpy as np
import matplotlib.pyplot as plt
import pandas as pd
import seaborn as sn
import matplotlib.pyplot as plt
from sklearn.metrics import confusion_matrix

working_path = os.getcwd() #sirve para establecer en qué carpeta estamos trabajando (ruta), ahora todos los archivos q se encuentren en esa carpeta solo los tenemos que llamar con su nombre

df = pd.read_csv("salida.txt",delimiter="\t") #Hacemos un dataframe, importando el archivo txt separado por tabuladores

df.rename(columns={'Number of Syllables': 'NSyllables'}, inplace = True) #Cambiamos (acortamos) nombres de la columna que indica el nº de sílabas

confusion_matrixV = pd.crosstab(df['TargetV'], df['RespV'], rownames=['Target'], colnames=['Response'], margins = True); #Matriz de confusión para VOCALES

enter the code here

I don't know how I could create the other 2x2 from this dataframe, I have supposed that through a for loop that it would start like this, but I don't know how to do it:

for index, row in confusion_matrixV.iterrows():

Assign colors to a confusion matrix in Python

How do I convert a large confusion matrix to a 2x2 matrix in python? All this being both a dataframe

HTML button that sends you to another page

Why do I get the error "Call to undefined function mysql_connect()"?

How to create an HTML button that works as a link?

How to separate a String in Java. How to use split()

Filter by dates in sql server

How to limit the number of decimal places in a double?

For each in JavaScript?

Position footer ALWAYS glued to the footer

Definitive Guide to Type Conversion in Java

How to properly compare Strings (and objects) in Java?

David GG's questions