I am new to python (coming from R), and I am trying to understand how I can convert a timestamp series in a pandas dataframe (in my case this is called df['timestamp']) into what I would call a string vector in R. is this possible? How would this be done?

I tried df['timestamp'].apply('str'), but this seems to simply put the entire column df['timestamp'] into one long string. I'm looking to convert each element into a string and preserve the structure, so that it's still a vector (or maybe this a called an array?)

3

Best Answer


Consider the dataframe df

df = pd.DataFrame(dict(timestamp=pd.to_datetime(['2000-01-01'])))dftimestamp0 2000-01-01

Use the datetime accessor dt to access the strftime method. You can pass a format string to strftime and it will return a formatted string. When used with the dt accessor you will get a series of strings.

df.timestamp.dt.strftime('%Y-%m-%d')0 2000-01-01Name: timestamp, dtype: object

Visit strftime.org for a handy set of format strings.

Use astype

>>> import pandas as pd>>> df = pd.to_datetime(pd.Series(['Jul 31, 2009', '2010-01-10', None])) >>> df.astype(str)0 2009-07-311 2010-01-102 NaTdtype: object

returns an array of strings

Following on from VinceP's answer, to convert a datetime Series in-place do the following:

df['Column_name']=df['Column_name'].astype(str)