Skip to content Skip to sidebar Skip to footer

Python - How To Split Column Values Using Multiple Separators

I am reading a .csv file and creating a Panda Dataframe. From this Dataframe I am fetching a value which is supposed to be a 'list' item with comma separated values in it. But it c

Solution 1:

This is a common issue with CSVs. Fortunately, you can nip this in the bud, simply by reading your CSV properly, so you don't have to do all this unnecessary post-processing later.

When reading your dataframe with read_csv, pass a regex to sep\ delimiter -

df = pd.read_csv(..., sep='\s*,\s*', engine='python')

Now, df.columns should be a list of strings.

Solution 2:

You can make use of skipinitialspace=True parameter:

df = pd.read_csv(filename, sep=',', skipinitialspace=True)

skipinitialspace : boolean, default False

Skip spaces after delimiter.

NOTE: this parameter takes care only of spaces after delimiter, so @cᴏʟᴅsᴘᴇᴇᴅ's answer is more generic.

Solution 3:

import re

column_names = "First_Name , Last_Name,Middle_Name"
l = re.compile("\s*,\s*").split(column_names)
print(l)

Post a Comment for "Python - How To Split Column Values Using Multiple Separators"