Merge Excel sheets with different structure with Talend - excel

I have one excel file with this structure:
sheet1 - column1 column2 column3 column4
sheet2 - column1 column3 column4
what i want is to merge both sheets into one, by column name (i.e sheet2 will have null values on column2 but column1 and column3 should merge)
so I should have one sheet with column1 column2 column3 Only
what talend does is that he takes first 3 columns from both sheets and merge them
i.e col1 with col1, col2 with col3 and col3 with col4.
i wanna do this dynamically, i dont really have only 2 sheets, i have a lot, and a lot of excel files, example is just for explaning
Data looks like this
All sheets have commun columns, but some sheets have more columns some less

Related

Excel Dropdown - Data validation only list col1 entries where col2 value = x

I have a workbook with multiple sheets.
i have a dropdown on sheet 1 that retrieves a list of values from sheet 2 as a data validation by list.
on sheet 2, there are entries in col1 and col2... col1 is what appears in the sheet 1 dropdown.
in the dropdown, i only want the entries from sheet 2, col1 where col2 in the same row as col1 ="Y"
basically my dropdown is supposed to show the values marked as active on the second sheet
i'm digging around for how to do this... as i'm almost certain i've done exactly this before.
I know it is a "lazy" answer - but you can find a full (very detailed) walkthrough for your requirement afai understood it, here: https://www.contextures.com/exceldropdownfilteredlist.html

VLOOKUP search issue

Am having some issue using the VLOOKUP function in excel. Am a little new to excel and puzzled as to how to solve this problem. I have two excel sheets.
Sheet 1 looks like:
Col1 Col2 Col3
HIC GSN ND11
H1C 00214 0212107
C4I 07287 0214380
L1A 07731 0214501
Sheet 2 looks like:
Col1 Col2 Col3 Col4 Col5 Col6 Col7
Condition HIC1 HIC1 HIC1 GSN GSN GSN
AMA B60 B61 B62 02934 02935
ALD H1A H1C 04821 03473
HEC W0A W0B
The VLOOKUP formula that am using is:
=VLOOKUP(B2,'Conditions.xlsx]Conditions'!$B$2:$BH$24,60,0)
Here B2 refers to Sheet 1, Column 1. Index Number 60 is a repeated Column 1 in Sheet 2. Is there a better way to lookup the COl1 and Col2 in Sheet one to extract the Conditions Columnin Sheet 2?
Any help would be great.
Read this: How to use INDEX MATCH instead of VLOOKUP
But what happens if you want to look from right to left? VLOOKUP simply can't do that. INDEX/MATCH can.

Finding and obtaining the difference between two columns

I have two columns. Each column has tens of thousands of values. I need to find the difference between them and print the difference in some cells. I read similar questions, but they are not enough for my question, highlighting the different cells is not enogh for me because it would be very tiring to look at tens of thousands of cell by searching highlighted cells. Thus, i need to obtain the values.
Example:
Column1 Column2
John Jennifer
Mary Washington
Joe John
Michael Texas
Houston Newyork
Texas Mary
Values existing in col1 but not col2 : Joe, Michael, Houston
Values existing in col1 but not col2 : Jennifer, Washington, New York
Algorithmically, i need to check each row of column1 whether it exists in Column2, if the row does not exist in Column2, the value is taken.
Similarly, i need to check each row of column2 whether it exists in Column1, if the row does not exist in Column1, the value is taken.
Thanks
Old: If this is a once-off i'd make a backup then mark the area of the two columns and "remove duplicates" which is a button on the DATA command bar.
You would be left with unique records only, which is what I understand you want.
Edit: If you want to completely remove duplicate values then: Assuming that col1 is "A" and col2 is "B" then this formula only shows the record if it is in A but not in B. You can make a similar one for B not in A. In my example I made two columns C, D for the unique values. Then filter the list on these being "not nothing", and you have a set of unique records.
Current formula is for A2 cell - in my example placed in C2.
=IF(ISERROR(MATCH(A2;B:B;0));A2;"")

Counting duplicate rows using Excel

I need to count the number of duplicate occurrences of values in a column using excel, ONLY for rows with a certain value in second column i present.
Column1 Column2
value1 x
value1 x
value1 x
value1
value2 x
value2
value3 x
---> should give
VALUE Occurencies
value1 3
value2 1
valu3 1
How to do this?
Thanks!
You can use a pivot table, which does not require any typing of values or any formulas.
Drag column1 into the row area and column2 into the values. If it is text, it will be counted. If the other values in column 2 are blank you are done. If they contain values, you can drag column2 into the filters area and then use the filter dropdown above the pivot table to select what value in column2 to use.
After you add new data to columns A and B, select the pivot table and refresh it.
If Column1 is in A1, please try:
=COUNTIFS(A:A,D2,B:B,"<>")
copied down to suit where D2... contain Value1 etc

Query worksheet using column range

I'm trying to query an excel worksheet using ADODB. The problem is there are many columns with a similar name and I'm unable to select the correct column in my query. Is there a was to select the column by using its range? Something like
select [A:A],[AB:AB] from [Sheet1$]
The source worksheet kind of looks like this
A B C D E F G H I
1 08/19/2013 08/18/2013 08/17/2013
2 Col1 Col2 col3 col4 col5 col3 col4 col5 col3
3
When I try to import all the data I get all the data minus the column names, only col1 and col2 are fetched. Its the same when I do from [Sheet1$] and from [Sheet1$A2:K100]
If there are multiple columns with the same or similar names, Excel/ODBC will probably do some funky name-mangling to make sure they are different in the recordset. What I would do is get all the columns with "SELECT *" then examine the fields names in the result set - then you can go back and retrieve by name just the columns you want.

Resources