why y.size is 65536 but y size in new dataset is 1000

classic Classic list List threaded Threaded
1 message Options
Reply | Threaded
Open this post in threaded view
|

why y.size is 65536 but y size in new dataset is 1000

hagersaleh
please help me, code write in spark by python
error is

Caused by: java.lang.IllegalArgumentException: requirement failed:
BLAS.dot(x: Vector, y:Vector) was given Vectors with non-matching sizes:
x.size = 1000, y.size = 65536

why y.size is 65536 but y size in new dataset is 1000

1-I train model on spark using sentiment140 dataset and LogisticRegression
2- I save model on computer
lrModel.save("hager")
3- when I load model to test new data set
from pyspark.ml.classification import LogisticRegressionModel
loadmodel=LogisticRegressionModel.load("hager")
4-when I calculate accuracy
predictions=loadmodel.transform(df)
from pyspark.ml.evaluation import BinaryClassificationEvaluator
evaluator = BinaryClassificationEvaluator(rawPredictionCol="rawPrediction")
evaluator.evaluate(predictions)

#accurcy
accuracy = predictions.filter(predictions.label ==
predictions.prediction).count() / float(val_set.count())
accuracy



--
Sent from: http://apache-spark-developers-list.1001551.n3.nabble.com/

---------------------------------------------------------------------
To unsubscribe e-mail: [hidden email]