Remove all duplicates from 2 columns in spreedsheet - excel

I have two columns in a spreedsheet each with different values, I need to compare the two columns and remove the duplicates from each. I need the columns to stay separate from each other so I cant remove the duplicates and mash the two columns into one. How do I remove the duplicates while keeping the columns separate?

The simplest way for you to do this would be to have one column next to each of your data columns where you can have the formulas.
Say that you have the data in column A and D.
In column B you can write;
=IF(IFERROR(MATCH(A1;D:D;0);FALSE);A1;"")
And copy this to all rows below. In column E you can write;
=IF(IFERROR(MATCH(D1;A:A;0);FALSE);D1;"")
And copy to all rows in E.
This will write out the value if it exists in the other column.

Related

How to remove duplicated rows in excel when there is certain matching values

I want to delete all the rows that contains duplicates in specefic columns.
So for example, If there's row 1 and 2, and both of them have the same values in column B and E, then I want the duplicates to be deleted by row.
Does it make sens? English isn't my first langage so it's hard to put in word what I want. Thank you all.
For example, the first row should be deleted only despited having the same value 3 times in "numero du certificat..."

How to merge to two rows if two cols contain same value

I want to merge excel rows comparing values of two columns: Column A and Column B.
If the values of Col A and Col B are same it should be merged with other rows(can be multiple rows) which has the same values in Col A and Col B.
Values in Third column should be appended with comma to the merged row.
Please refer this image to get more clarity, it is easier to explain with an example:
As per my below screenshot try-
E2=UNIQUE(A2:B4)
G2=TEXTJOIN(", ",TRUE,IF($A$2:$A$4&$B$2:$B$4=E2&F2,$C$2:$C$4,""))

How to split Excel row into multiple based on value in cell

I would like to split the rows out in the first table based on comma separated values in Column C.
So columns A,B,D,E would be duplicated for each comma separated value in column C.
So the data would result in the table below.
Thanks
Here's an algorithmic approach you can try
Get a reference to your data range
Loop over the rows in that range
Since you're going to be adding rows, loop bottom to top
For each row
Split the value in column C
If the resulting array has > 1 values, insert rows for the new values
Copy Values into the new rows
Transpose the Split array into column C
Repeat

Excel - Comparing multiple columns to see if results are identical

I would like to compare three separate columns in an excel spreadsheet, across thousands of rows.
If any value appears in column A multiple times (say the word hello in column A rows 1 and 4, and the word bye in column A and rows 3 & 5, I would like to check the corresponding values in column B for those rows (ie rows 1&4 and 3&5).
If the values in column B for rows 1&4 are say 15 & 15, and the values for rows 3&5 are 20 & 20 , then I want to check column C.
Now we know rows 1&4 and 3&5 have the same corresponding values in column A & B, I would like to check the corresponding values in column C. If these are different then I would like to perform a specific calculation. If they are the same values in Column C, then I want to ignore these rows.
I am sorry this is very unclear, as I cannot paste an image to show what I mean. I can email you an example if it helps.
This is way beyond me and my excel skills and I do not know where to start. Any help would be appreciated. I am hoping I don't need to write a Macro.
Thanks in advance!
So, to resummarize your question as I understand it:
Column A holds string values (text). There are some duplicates here.
Column B holds number values. When a duplicate occurs in column A, the data in column B may or may not be identical as for the other duplicate entries.
Column C holds values (you did not define what type of values, but I assume these are number values). Sometimes, duplicates in column A hold the same values in column B, and also the same values in column C. In this case, we can ignore the row as all the duplicates agree. Sometimes, duplicates in column A hold different values in column B. In this case, we can also ignore the values. Finally, sometimes duplicates in column A hold the same values in column B, but different values in column C. For these specific values, we want to perform some other type of calculation (which you did not specify).
Put the following in column D, starting at row 2 (assuming a header on row 1), which is the starting point of the formula we will build.
=IFERROR(VLOOKUP(A$1:B1,A2,2,0)=B2,"")
This says: Look at column A, starting always at row 1, and going until 1 row above the current row. Check for a match of the text in the current row. If it finds a match there, pull the result from column B. Does that result match column B in the current row? If it matches it will say TRUE; if it doesnt match it will say FALSE. If there are no duplicates yet in column A it will say "".
Now add a new check - if the above formula is TRUE [ie: there is a duplicate in column A, and the result in column B matches], then we want to check the results from column C:
=IFERROR(IF(VLOOKUP(A$1:B1,A2,2,0)=B2,VLOOKUP(A$1:C1,A2,3,0)=C2,""),"")
This will now return TRUE if the values in column C match for that duplicate in column A (which is only checked if the values in column B match too). Finally, add in your "special calculation", like so:
=IFERROR(IF(VLOOKUP(A$1:B1,A2,2,0)=B2,IF(VLOOKUP(A$1:C1,A2,3,0)=C2,"",C2+1),""),"")
Where I have C2+1, this is where you will perform your special calculation. This will only be recorded by Excel if: there is a duplicate in column A, that duplicate has a matching value in column B, and that duplicate has an unmatched value in column C.

Is there a way to delete cells if their value is contained in another column?

I have a spreadsheet with multiple columns with a few thousand rows and I would like to find the cells that are common across all columns. Is there a function that I can use to check if a cell value exists in a set of cells/column?
To find out if a value exist in all columns but in any row you can put this equation in the next open column and drag down:
=AND(MATCH(A1,B:B,0),MATCH(A1,C:C,0))
This assumes you have data in column A, B & C and the equation is in column D. now you can sort on column D for unique values.
Depending on your data type you might get an error. If that is the case try this:
=AND(IFERROR(MATCH(A1,B:B,0),FALSE),IFERROR(MATCH(A1,C:C,0),FALSE))

Resources