Spark机器学习6·聚类模型(spark-shell)
K-均值(K-mean)聚类 目的:最小化所有类簇中的方差之和
- 类簇内方差和(WCSS,within cluster sum of squared errors)
- fuzzy K-means
层次聚类(hierarchical culstering)
- 凝聚聚类(agglomerative clustering)
- 分列式聚类(divisive clustering)
0 运行环境
cd $SPARK_HOME
bin/spark-shell --name my_mlib --packages org.jblas:jblas:1.2.4 --driver-memory 4G --executor-memory 4G --driver-cores 2
import org.apache.spark.mllib.recommendati
