What is a promise in Javascript?

Question

Asked: 2021-11-10 19:52:20 +0800 CST 2021-11-10 19:52:20 +0800 CST 2021-11-10 19:52:20 +0800 CST

Plot of a Clustering

772

I have a vector called pca, from which I decompose its values which I call xy yin the following way:

from pyspark.ml.linalg import Vectors
var = transformed.select('customer_id','pca')
def extract(row):
    return (row.customer_id, ) + tuple(row.pca.toArray().tolist())
var_a = var.rdd.map(extract).toDF(["customer_id"]) 
var_a = var_a.withColumnRenamed("_2","x")
var_a = var_a.withColumnRenamed("_3","y")
var_a.show()

Giving me as a result:

Then I separate x and y as follows:

x = var_a.select("x")
y = var_a.select("y")

This in order to be able to make one ScatterPlotof the two variables, my attempt was as shown in the following code. It is worth mentioning that the column predictionI refer to only brings me values from 0 to 6, therefore the seven assignment colors to differentiate the clusters.

df = predictions_pca.select('prediction').toPandas()
colores=['red','green','blue','yellow','fuchsia','black','purple']
asignar=[]
for row in df:
    asignar.append(colores[int(row)])
        
plt.scatter(x, y, c=asignar, s=1)
plt.xlabel('Var_1')
plt.ylabel('Var_2')
plt.title('K-Means Clustering')
plt.show()

However, despite this, I get an error in the code. Could someone guide me or tell me what I'm doing wrong.

I attach the trace of the error that marks me:

1 Answers

Voted

Ricardo J. Martínez Suástegui · Answer 1 · 2021-11-12T19:25:00+08:00

I was finally able to solve it. What I was missing was a collect and cast the variables to listaas follows:

x = list(var_a.select('x').collect())
y = list(var_a.select('y').collect())

With practically the same code to make the Plot:

df = predictions_pca.toPandas()
colores=['red','green','blue','yellow','fuchsia','cyan','purple']
asignar=[]
for row in range (len(df)):
    asignar.append(colores[df['prediction'][row]])
plt.figure(figsize = (16, 9))    
plt.scatter(x, y, c=asignar)
plt.scatter(cent[:,0], cent[:,1],marker='*', c='black') # Marco centroides.
plt.xlabel('X')
plt.ylabel('Y')
plt.title('K-Means Clustering')
plt.show()

I got the expected output:

I hope someone else finds it helpful Greetings!

Plot of a Clustering

HTML button that sends you to another page

Why do I get the error "Call to undefined function mysql_connect()"?

How to create an HTML button that works as a link?

How to separate a String in Java. How to use split()

Filter by dates in sql server

How to limit the number of decimal places in a double?

For each in JavaScript?

Position footer ALWAYS glued to the footer

Definitive Guide to Type Conversion in Java

How to properly compare Strings (and objects) in Java?