In this Post, We will learn to get the current date in pyspark with example
Getting current date
Following lines help to get the current date and time .
import findspark
from pyspark.sql import Row
from pyspark import SparkContext , SparkConf
import datetime
now = datetime.datetime.now()
#Getting Current date and time
print (now.strftime("%Y-%m-%d %H:%M:%S"))
Output
Getting current date and current timestamp within dataframe
current_date() helps to get the current date and current_timestamp() used to get the timestamp .
import findspark
findspark.init()
from pyspark import SparkContext,SparkConf
from pyspark.sql import Row
from pyspark.sql.functions import *
sc=SparkContext.getOrCreate()
#creating dataframe with three records
df=sc.parallelize([Row(name='Gokul',Class=10,marks=480,grade='A')]).toDF()
print("Printing df dataframe below ")
df.show()
#Getting current date and timestamp
ddf.withColumn("currentdt",current_date()).withColumn("timestamp",current_timestamp()).show()
Output
Printing df dataframe below
+-----+-----+-----+-----+
|Class|grade|marks| name|
+-----+-----+-----+-----+
| 10| A| 480|Gokul|
+-----+-----+-----+-----+
+-----+-----+-----+-----+----------+--------------------+
|Class|grade|marks| name| currentdt| timestamp|
+-----+-----+-----+-----+----------+--------------------+
| 10| A| 480|Gokul|2020-02-27|2020-02-27 21:45:...|
+-----+-----+-----+-----+----------+--------------------+
Reference
http://spark.apache.org/docs/latest/api/python/pyspark.sql.html?highlight=date
Related Articles
renaming dataframe column in pyspark