Excel - dynamically placing boolean based on multiple criteria - excel

First off let me say thanks and sorry for potential poor formatting. I tried to make it as straightforward as possible, but I am not sure how to insert a table into this box.
I have the following two columns side by side in Excel, in columns A and B.
1 INIT
2 INIT
3 INIT
4 INIT
4 UNWIND
3 INIT
5 INIT
1 UNWIND
In column C, I would like to place a 1 if the numbers in the first columns are the same AND those numbers have both an INIT and the UNWIND. Thus the final solution would look like this:
1 INIT 1
2 INIT 0
3 INIT 0
4 INIT 1
4 UNWIND 1
3 INIT 0
5 INIT 0
1 UNWIND 1
Note in the example above, even though 3 showed up in column A twice, a zero was placed in column C because the second "3" had "INIT" in column B, not "UNWIND".
I was thinking of using a combination of If statements with Match, etc., but I'm not sure how to dynamically complete the task, since the range over which you are searching will change (all rows of the column, excluding the row you are currently in). In this example, for row 1, you would simply search all rows below it. However, for row 2, you would want to search the row above it, and all those that follow, and so on.
Any help is greatly appreciated.
Thanks

It looks like you're trying to test to see if any number in column A has both an "INIT" and "UNWIND" value. The simplest solution here is COUNTIFS with an AND.
=AND(COUNTIF(A:A,A1)=2,COUNTIFS(A:A,A1,B:B,"INIT")=1,COUNTIFS(A:A,A1,B:B,"UNWIND")=1)
Of course, note that the formula I provided will count ALL of the columns, and requires exactly two matching rows with one INIT and one UNWIND. And it'll return a boolean true or false rather than a 1 or 0, but you should be able to cast a boolean to whatever value easy enough.
(And if possible, use Tables post Excel 2007. They make the formula FAR easier to read.)

Alternate formula:
=--AND(COUNTIFS(A:A,A1,B:B,{"INIT","UNWIND"}))

Related

If match found yes/ no, two different tables, two different values

I did not see something like this in other questions/ forums so hopefully it can be done.
Need to know if values from one table are in another, checking to see if there is a match.
Table 1 in Sheet 1 is used to record "Incoming" data of Part Number and Lot Number.
Table 2 in Sheet 2 is used to record when record is "Outgoing".
Column A is Part Number, B is Lot number in both Sheets. Part number can repeat, but Lot # will not. Trying to find a way to return a Yes/ No or 1,0 if part number and lot number in Sheet 1 exists in Sheet 2 in Column C of Sheet1. I have attached a Snippet example what I am trying to do. This will help me generate info on if an Incoming record has been completed and left (Outgoing). I do not believe vlookup will work and have tried some different permutations of match. Open for other options. Thanks!!
Edit: Lot # does have to ablity to repeat (not often) but with a different corresponding Part Number. Need to know if there is a match with both Lot# and Part Number as in the Incoming record.
Use:
=--(COUNTIFS(Sheet2!A:A,A2,Sheet2!B:B,B2)>0)
If there are matches it will return 1 if not 0

Find values through index, but safe from deletion

I've got a input sheet that contains several different variables among which names, location and job. I want to display these values on different sheet, in which I summarize the data by group. So the input is as follows
A B C
1 NAME CIYY WORK
2 Bart NY Plumber
3 Emily NY Firefighter
4 Daniel LA Firefighter
5 Cohen NY Firefighter
6 Sasha LA Plumber
7 Dirk LA Plumber
8 Molly LA Plumber
Now I want to find the names from plumbers in LA, meaning it needs to look like this: (no #NA and no empty cells in between)
A
1 NAME
2 Sasha
3 Dirk
4 Molly
Should be easy with Indexing, Matching etc. However, the input is changed by simply deleting rows and adding rows. How can I achieve this without the risk of #REF errors. When Dirk would be deleted for instance, the output should look as follows:
A
1 NAME
2 Sasha
4 Molly
The solution on this page comes pretty close but leaves me with a #NA when the conditions are not met.
Although the values will be unique, I have a feeling that this solution for extracting unique values will be helpfull, but have been unable to combine them.
You need a two column match using INDEX and AGGREGATE.
=INDEX(A:A, AGGREGATE(15, 6, ROW(A$1:INDEX(A:A, MATCH("zzz", A:A)))/((B$1:INDEX(B:B, MATCH("zzz", A:A))=G$2)*(C$1:INDEX(C:C, MATCH("zzz", A:A))=H$2)), ROW(1:1)))
Fill down for successive matches. Use an IFERROR wrapper if you do not want error codes once you have run out of matches.
Try something like this, giving the whole range, cells and columns:
=INDEX(1:1048576,MATCH("Bart",A:A,0),MATCH("NY",2:2,0))
Thus, you can delete from A:A and from 2:2, the ranges would stay untouched.
I found a way to solve this issue by using INDRECT. I do not like this function however and would still advise to use the solution proposed by Jeeped.
The solution requires three steps.
Add an extra column with the two strings combined: i.e. D2 = B2&C2, and store the row of the first occurence of LAPlumber with =MATCH($M$17;D:D;0)
Now, iteratively, find the row of the next occurence of LAPlumber with: =IFERROR(MATCH($M$17;INDIRECT("$D$"&SUM($N$17:$N17;1)&":$D$16";TRUE);0);""). This function uses the previous row-numbers to redefine the array with INDIRECT. Note, the results of these values are relative to the previous row.
For each row, find the corresponding name: =INDEX($A$1:$C$15;SUM($N$17:N18);O$16). The sum function is used to combine the relative row-numbers.
In the picture I added a screenshot of step 3. Cell H3 contains step 1. The other cells in column H show the row numbers from step 2.

Rank the top 5 entries in different criteria

I have a table that I want to find the top X people in each of the different groups.
Unique Names Number Group
a 30 1
b 4 2
c 19 3
d 40 2
e 1 1
f 9 2
g 15 3
I've ranked the top 5 people by number by using =index($A$2:$A$8,match(large($B$2:$B$8,1),$B$2:$B$8,0)). The 1 in the LARGE function I linked to a ranked range so that when I dragged down it changed up the number.
What I would like to do next is rank the top x number of people in each group. So top 3 in group 1.
I tried =index($A$2:$A$8,match("1"&large($B$2:$B$8,1),$C$2:$C$8&$B$2:$B$8,0)) but it didn't seem to work.
Thanks
EDIT: After looking at the answers below I have realised why they are not working for me. My actual data that I want to use the formula with have multiple entries of numbers. I have adjusted the example data to show this. The problem I have is that if there are duplicate numbers then it returns both of the names even if one is not in the group.
Unique Names Number Group
a 30 1
b 30 2
c 19 3
d 40 2
e 1 1
f 30 2
g 15 3
Proof of Concept
Use the following formula in the example above in cell F2 and copy down and to the right as needed.
=IFERROR(INDEX($A$2:$A$8,MATCH(AGGREGATE(14,6,($C$2:$C$8=F$1)*($B$2:$B$8),ROW($A2)-1),$B$2:$B$8,0)),"")
In the header row provide the group numbers. or come up with a formula to augment and reset the group number as you copy down based on your X number in your question.
Explanation:
The AGGREGATE function unlike the large function is an array function without the need to use CSE. As such we can add criteria to what we want to use. In this case only 1 criteria was used and that was the group number. in the formula it was the following part:
($C$2:$C$8=F$1)
If there were multiple criteria we would use either an + operator as an OR or we would use an * operator as an AND.
The 6 option in the aggregate function allows us to ignore errors. This is useful when trying to get the small. It is also useful for dealing with other information that may cause errors that do not need to be worried about.
As this is technically an array operation avoid using full column/row references as they can bog down your system.
The basics of what the over all formula is doing is building a list that match the group number you are interested in. After filtering your numbers, it then determines which is the largest, second largest etc by what row you have copied down to. It then determine what row the nth largest number occurs in through the match function, and finally it returns to the corresponding name to that row with the index function.
Building on all the other great answers.
Because you have the possibilities of duplicate values in each group we need to do this with two formulas.
First we need to get the numbers in order. I used the Aggregate, but this could be done with the array LARGE(IF()) also:
=IFERROR(AGGREGATE(14,6,$B$2:$B$8/($C$2:$C$8=E$1),ROW(1:1)),"")
Then using that number and order we can reference, we can use a modified version of #ForwardEd's formula, using COUNTIF() to ensure we get the correct name in return.
=IFERROR(INDEX($A$2:$A$8,AGGREGATE(15,6,(ROW($B$2:$B$8)-ROW($B$2)+1)/(($C$2:$C$8=F$1)*($B$2:$B$8=E3)),COUNTIF(E$2:E2,E3)+1)),"")
This will count the number in the results returned and then bring in the correct name.
You could also solve this with array formulas - to filter a group whose name is stored in E1, your code
=INDEX($A$2:$A$8,MATCH(LARGE($B$2:$B$8,1),$B$2:$B$8,0))
would then be adapted to
=INDEX($A$2:$A$8,MATCH(LARGE(IF($C$2:$C$8<>E1,-1,$B$2:$B$8),1),$B$2:$B$8,0))
Note: After entering an array formula, you have press CTRL+SHIFT+ENTER.
Thank you to everyone who offered help but for some reason none of your methods worked for me, which I am sure was to do with the quality of my data. I used an alternate method in the end which is slightly convoluted but seemed to work.
=IF($C2="1",RANK($B2,$B$2:$B$8,1)+ROW()/10000,-1)
Essentially using the rank function and adding a fraction to separate out duplicate values.

Excel - find the biggest gap between numbers in rows

I have an excel file with >12500 rows in one column.
It contains such random strings with 20 digits:
2,3,4,6,7,8,12,13,14,24,30,42,45,46,48,50,56,58,**59**,61
1,2,6,8,11,12,13,16,17,21,24,27,28,33,34,42,44,48,58,61
3,7,10,13,14,15,18,21,23,24,25,29,30,34,37,48,51,56,57,60
8,11,13,16,17,19,21,27,29,35,36,39,42,44,46,50,53,54,57,60
2,4,7,9,21,26,28,30,32,34,35,37,38,39,43,44,50,60,61,62
10,13,15,18,21,22,23,24,25,26,40,42,48,49,51,52,56,**59**,61,62
1,2,4,7,14,15,18,20,24,29,30,32,35,41,42,50,52,55,58,62
1,4,8,9,10,12,17,24,25,33,37,41,43,44,46,49,52,**59**,61,62
1,2,4,6,9,12,15,17,21,24,30,31,32,36,41,44,47,48,51,58
2,7,10,12,15,16,20,24,25,27,30,33,39,44,45,52,54,55,58,60
5,7,10,11,20,22,24,31,32,33,36,38,39,41,43,47,50,52,56,58
3,6,8,9,14,15,19,21,25,28,34,37,39,45,47,54,55,56,57,**59**
1,2,3,4,5,8,14,15,18,20,23,31,33,37,42,45,46,51,52,55
I need to know whats the biggest gap between rows where a number hasn't repeated. For example - I search for any number (e.g 59) and I need to know what's the largest gap between two rows where number 59 hasn't repeated.
In this example it's 4 row gap between 59's.
Hope that I make myself clear.
Seems like a fun problem which admits a simple but not quite obvious answer. First -- make sure that the data is in 20 columns (use the text to columns feature under the data tab). Using your example, I came up with a spreadsheet that looks like:
V1 holds the target number. The formulas are in columns U.
In U1 I entered:
=IF(ISNA(MATCH($V$1,A1:T1,0)),1,0)
This formula uses MATCH to test if the value in V1 lies in the range to the left of it. If it doesn't the match function returns #N/A. The function ISNA checks for this error value. IF it is present, the overall formula returns 1 (since there are now 1 consecutive row without the target number) otherwise it returns 0.
The formula in U2 is similar with a little twist:
=IF(ISNA(MATCH($V$1,A2:T2,0)),1+U1,0)
The same basic logic -- but rather than returning 1 if the target number isn't present it adds 1 to the number above. The formula is then copied down the rest of the range. It has the effect of keeping a running total of consecutive rows without the target value. This running total is reset to 0 whenever a row with the target value is encountered.
The final ingredient requires no comment. In U14 I just have
=MAX(U1:U13)
which is the number you are looking for (assuming that the maximum number of consecutive rows without the target number is what you are looking for, even if this occurs either at the top or bottom of the data. If you want the largest gap that is literally between two rows where the number occurs, the logic would need to be made more complex).

Excel VLOOKUP not finding correct row

I have the following table of two columns:
102-6956821-1091413 1
115-8766130-0234619 2
109-8688911-2954602 3
109-7731824-8641056 4
If I put in the following VLOOKUP:
=+VLOOKUP(B2,B$2:C$5,2)
I get the result of:
1
2
1
1
If I change it to =+VLOOKUP(B2,B$2:C$5,2, FALSE) I get the expected:
1
2
3
4
But why is this? There is an exact match available so why does it need to approximate? If it is, why is it generating the numbers it is? How is it reducing that text value to a number that it is approximating from? Thanks!
Transfer from comment for the sake of an answer:
If your search list (ColumnB) were sorted you would see the results you expect anyway (though in a different order). For speed, VLOOKUP is using a binary search method for which an ordered list is necessary if to achieve meaningful results. There are exact matches only in the first half of the unsorted list (hence 1 and 2 are correct but neither 1 nor 1 is).

Resources