Sometimes, your data comes ith several bits of information in one column. Like a column with US states in the format
US-TX. Or a column with companies and the product they sell:
Buy say you want contry (
US) separate from state (
TX), e.g. to create a Datawrapper choropleth map. Good thing there are easy ways to separate data points into two or more columns.
I’ll show two ways to create several new columns out of one column. To do so, we’ll use Google Sheets – but this should work with LibreOffice Calc, Excel or any other spreadsheet software.
The first method is the formula
Split columns with SPLIT()
- Create at least two columns next to the column with the data you want to split. You can do so, click on the header (
C, etc.). Then click the little triangle and select “Insert 1 right”. Repeat to create a second free column.
- In the first free column, write
B1being the cell you want to split and
-the character you want the cell to split on. (If you see the error
#REF!in your cell, you’ll need to create more columns.)
- To apply the changes to the cells below, drag down the blue square in the bottom right of the selected cell(s). Double-click on the blue square to fill all remaining cells.
Extract content from columns with LEFT()
Sometimes you don’t have clear separator characters, but just want to extract the first or last characters of a cell. To do so, use the formulas
- Insert a new column. (Or two. Or three! As many as you need.)
- In the new column(s), write
=LEFT(B1,2)to extract the first 2 characters of the cell B1.
=RIGHT(B1,8)to extract the last 8 characters of the cell B1.
=MID(B1,4,2)to extract the 2 characters following the 4th character in B1.
- To apply the changes to the cells below, drag down the blue square.
Pro tip 1: You can combine formulas to extract characters at all sorts of crazy positions. For example, the formula
=LEN() gives back the number of characters in a cell. So
=LEFT(A1,LEN(A1)-2) extracts the entire text in a cell except the last two characters. To separate the cell
Datawrapper (Software) into the two cells
Software, you could use the formula
=SPLIT(LEFT(A5,LEN(A5)-1),"(". This formula first removes the last bracket and then splits the remaining cell content on
Pro tip 2: Now that you learned to separate text, you can also bring it together again. To combine the column
US from your cell A1 and
TX from B1 with a hyphen, use ampersands and write
Pro tip 3: You can also extract content with
MID() not just from text cells, but also from number and date cells. If you want to apply formulas like
LEFT() to your dates, it helps to transform them to the text format first. To do so, use the formula
=TEXT(A1, "MM/DD/YYYY"). Instead of
MM/DD/YYYY, you can use any combination of these date codes and
-, a space, etc. For example,
=TEXT(A1, "dd-mmm-yyyy") will transform the date format
1st of November 2019 to a text cell with the content
I hope this was helpful! If you need more help cleaning your data to prepare it for a charting tool like Datawrapper, visit our article “How to prepare your data for analysis and charting in Excel & Google Sheets”. And if you have any questions, please leave a comment or write to me at email@example.com.