以åæ»æ¯åä¸æ¸
æ¥Sparkä¸flatmapåmapçåºå«ï¼ç°å¨å¼æç½äºï¼æ»ç»å享ç»å¤§å®¶ï¼å
ççflatmapåmapçå®ä¹ã
map()æ¯å°å½æ°ç¨äºRDDä¸çæ¯ä¸ªå
ç´ ï¼å°è¿åå¼æææ°çRDDã
flatmap()æ¯å°å½æ°åºç¨äºRDDä¸çæ¯ä¸ªå
ç´ ï¼å°è¿åçè¿ä»£å¨çææå
容æææ°çRDD
æäºæå£ï¼ççä¾åå°±æç½äºã
val rdd = sc.parallelize(List("coffee panda","happy panda","happiest panda party"))
è¾å
¥
rdd.map(x=>x).collect
ç»æ
res9: Array[String] = Array(coffee panda, happy panda, happiest panda party)
è¾å
¥
rdd.flatMap(x=>x.split(" ")).collect
ç»æ:
res8: Array[String] = Array(coffee, panda, happy, panda, happiest, panda, party)
flatMap说æç½å°±æ¯å
mapç¶ååflatï¼åæ¥ç个ä¾å
val rdd1 = sc.parallelize(List(1,2,3,3))
scala> rdd1.map(x=>x+1).collect
res10: Array[Int] = Array(2, 3, 4, 4)
scala> rdd1.flatMap(x=>x.to(3)).collect
res11: Array[Int] = Array(1, 2, 3, 2, 3, 3, 3)
è¿ä¸åºè¯¥å®å
¨æç½äºå§ï¼ä¸æç»æçè¨ï¼æ¬¢è¿ææ£ã
温馨提示:内容为网友见解,仅供参考