[Spark][Python]groupByKey例子
Spark Python 索引页 [Spark][Python]sortByKey 例子的继续: [Spark][Python]groupByKey例子 In [29]: mydata003.collect() Out[29]: [[u'00001', u'sku933'], [u'00001', u'sku022'], [u'00001', u'sku912'], [u'00001', u'sku331'], [u'00002', u'sku010'], [u'00003', u'sku888'], [u'00004', u'sku411']] In [30]: mydata005=mydata003.groupByKey() In [32]: mydata005.count() Out[32]: 4 In [33]: mydata005.collect() Out[33]: [(u'00004', <pyspark.resultiterable.ResultIterable at 0x7fcebe436b10>), (u'00001', <pyspark.resu...