数据如下[['my', 'name', 'is', 'lala'],['what', 'is', 'your','name']]
我们希望它[('my',1),('name',1),...]
使用python RDD和lambda函数
x = [(w,1) for l in data for w in l]
# [('my', 1), ('name', 1), ('is', 1), ('lala', 1), ('what', 1), ('is', 1), ('your', 1), ('name', 1)]