我有一个time
列,时间戳的形式为2018-04-12 06:48:39
.如何从此时间戳添加列Month
,在本例中包含 4 ?
pyspark.sql.functions.month
:
import pyspark.sql.functions as F
df.withColumn('month', F.month('time')).show()
+-------------------+-----+
| time|month|
+-------------------+-----+
|2018-04-12 06:48:39| 4|
+-------------------+-----+