Spark(Python) 从内存中建立 RDD 的例子
Spark(Python) 从内存中建立 RDD 的例子: myData = ["Alice","Carlos","Frank","Barbara"] myRdd = sc.parallelize(myData) myRdd.take(2) ---- In [52]: myData = ["Alice","Carlos","Frank","Barbara"] In [53]: myRdd = sc.parallelize(myData) In [54]: myRdd.take(2) 17/09/24 02:40:10 INFO spark.SparkContext: Starting job: runJob at PythonRDD.scala:393 17/09/24 02:40:10 INFO scheduler.DAGScheduler: Got job 5 (runJob at PythonRDD.scala:393) with 1 output partitions 17/09/24 02:40:10 INFO scheduler.DAGScheduler: Final st...