Change an excel sheet to be formatted differently - excel

I'm not really sure of the proper terms here so I didn't know how to search for an answer.
I have a csv file with information presented in the following way:
Carrier
State
Aetna Health and Life Insurance Company (AHLIC)
IOWA
Aetna Health and Life Insurance Company (AHLIC)
ARIZONA
Aetna Health and Life Insurance Company (AHLIC)
UTAH
Aetna
CALIFORNIA
Aetna
TEXAS
Aetna
IDAHO
Humana
UTAH
Humana
FLORIDA
Humana
GEOLRGIA
I need the information to be presented like so:
Carrier
State
Aetna Health and Life Insurance Company (AHLIC)
IOWA, ARIZONA, UTAH
Aetna
CALIFORNIA, TEXAS, IDAHO
Humana
UTAH, FLORIDA, GEORGIA
My table has about 2000 rows so I'd prefer to not do it completely manually. What would the best way to accomplish this be?

If you have access to the new array formulas
To get your list of unique values in the first column
=UNIQUE(A2:A10)
And to join the values associated with each unique value
=TEXTJOIN(", ",TRUE,FILTER(B:B,A:A=D2,""))

Related

Excel Multi Dependant Dropdowns Based on repeating data

Am looking to create a multi layer dropdown solution for selecting addresses, where the dropdowns options change based on prior selections.
E.g - Selection options are Region, Country, City, Address Line 1
The ask is that if someone selects Region EMEA, the country list should show only countries in EMEA. They then select a country and the city lists only those pertaining to that country.
The issue I think may be how the source data is structured, but hoping there may be a solution without have to alter the source information. If that's not possible, then any solution that works is great.
Preferably this will not use VBA. It can be used initially to structure the data, but not for the dropdowns to operate in use as the file will uploaded to Excel Online.
The structure of the address data is as follows as an example:
REGION
COUNTRY
CITY
ADDRESS LINE 1
APAC
Australia
Adelaide
12 Fairview Road
APAC
New Zealand
Auckland
1313 Cambridge Drive
EMEA
UK
London
24 Sycamore Street
EMEA
UK
London
25 Madison Avenue
EMEA
Spain
Madrid
567 Mulberry Street
APAC
New Zealand
Auckland
2345 George Street
APAC
India
Dehli
322 Locust Street
APAC
India
Dehli
12 Beech Street
APAC
India
Punjab
55 Route 7
LATAM
Brazil
Sao Paulo
7 Arch Street

search for multiple words in a column and then return the position of that term

I am working with excel and need some inputs on how we can search for multiple words in a column and then return the position from where the match was found. For example the table whose words i want to check are:
Column A Column B
North Carolina
South Boston
West Coast
East Central
The table i want to check these phrases in is below:
Column C
North West Carolina
Western Coastal
Eastern Time for Central
Southern Boston
The final output should give me something like below:
Column A Column B Column D
North Carolina 1
South Boston 4
West Coast 2
East Central 3
Note that we are searching for words in the 2nd table irrespective of the order in which they are. For example even though the first row in 2nd table is North West Carolina, we get a match. The output basically gives us the position of the phrase where we could match our text.
Can this be done in excel somehow?This seems to me like a combination of match() and search() somehow but i haven't been able to crack it. Can it be done?
I tried the formula listed below but its not working:
VLOOKUP(and($A1&"*",$B1&"*"),'Table2'!$D$2:$D$5,1,FALSE)
But this doesn't work
Thanks
Try this (assuming data starts in row 2)
=MATCH(1,ISNUMBER(SEARCH(A2,C$2:C$5))*ISNUMBER(SEARCH(B2,C$2:C$5)),0)
entered as an array formula using CtrlShiftEnter

excel - find same names but with different addresses

Ok I've been racking my brain about this but can't come up with a clean way of getting the data I need.
I have a large list of customer information. I am interested in finding out where customers have bought different items, the layout of my cells is like:
Customer Name, Customer Address, Product
Joe Bloggs, 123 The Avenue, Apples
Joe Bloggs, 123 The Avenue, Apples
Jim Smith, 456 The Road, Oranges
Jim Smith, 456 The Road, Apples
So in this case, I'm not interested in Joe Bloggs as he bought Apples both times. I'm interested in Jim as he bought Oranges AND Apples. Is there a helper column I could use to help me filter this data out? So, counting if the customer name and address appears more than once and if the products are different? Thanks.
the best way to do this is creating a pivot tabel wich is based on the customer and the product
then sort on amount of products
Succes
Toontje

How to merge similar rows in access/excel

I have the following table
Address, City, Data1, Data2, Data3
123 North 5th Street, San Francisco, A, B, C
123 N 5th Street, San Francisco, [Blank], D, [Blank]
123 North 5th St, San Francisco, E, F, G
I want to merge the data based on 2 criteria: the first 4 digits of the address and the city.
So the merge row would look like:
123 North 5th Street, San Francisco, AE, BDF, CG
I have about 6000 records include the "duplicates". I have the table in both access and excel, any help would be appreciated.
Are you sure you want to carry out the matching on that criteia? Would you for instance want to match the following record with those three above
123 North 4th Street, San Francisco?
The ideal way to do this is to address standardise the data first and then deduplicate it. In NZ for instance we use a PAF file (Postal Address File) to standardise the addressing and accurately issue a DPID (delivery point identifier) to each record. You'd then be in a position to match the data to carry out your second step of merging records (which is still a tricky exercise). There would be plenty of vendors around to facilitate this for you in the US for a small fee, Axciom I believe is a global player in this space.
If you don't want to do that, another option is to use a third party tool to match those records with some fuzzy logic rather than coding it yourself, I've used an addin to Excel before by a company called DQGlobal to run over data and match records.

How should I design an data table that has six or more variables?

The situation is akin to the following:
Assume I am a store selling fruits, I would like to record the costs of each type of fruit. Assume the customer has specific tastes, and they can differentiate everything.
A fruit can be an orange, apple,
pear, or peach
It could be n days fresh from the vendor
The fruits come from different countries and have different taxes for export
Fruits are also valued by weight
Fruits could have handling requirements (Fridge,water sprigs)
Fruits could also be valued just because of different origins
One idea is to do what is done in some industries, assign a unique product code for each variation: e.g. Orange5dayfreshAustralia200gfridgeSydney
This however, would be a HUUUUUGE long list of values, and should any costs change, it would be hell to search for Apples from Turkey that are 200 grams have now 20cents more export tax. Adding new variables would also massively mess up the whole table and require much recoding.
The other way is to assume a standard orange, peach, apple and pear and different weight(2 dimensional table). On other tables, prepare a list of cost savings/increase for any given deviation. (Law of probability should show me I have 80% "standard" fruits, and 20% niche fruits from funky places having funky costs)
E.g. a 1 day fresh orange compared to a standard 4 day fresh orange is 50cents more, regardless of weight.
(Yes, this means the primary table would need to have at least one variable that dont affect the others)
Final constrain: The data entry is done by hired hands, needs to be simple to enter in Excel. The programing can be difficult, but not the general interface.
Any suggestions?
If the information needs to be encoded in the product code, you can use subcodes to build a complete product code.
Type of fruit
AP - Apple
OR - Orange
PR - Pear
PC - Peach
You can use Country Codes for the country, and other subcodes for handling and origin.
EXAMPLE:
AP06BR145HROR
is a Brazilian apple, six days old, weighing 145 grams, with HR handling and OR origins.
Obviously you would need a lookup table and foreign keys for each of the code types.
You seem to have twigged your problem - your data structure is wrong.
Your list is really a collection of attributes that apply to each "batch" of stock you purchase - except perhaps for "n days fresh" where the attribute is the difference between the attributes of date picked and date delivered.
So what you really want is a data table for each attribute type, and then a transcation data table where you add the attributes. The input is a form where the attributes are loopups.
You also need your pricing rules in a table, which you can use to calculate prices.
If your transaction table records both sales and purchases then you can filter on the attributes to report by any combination you like.
But the bit that interests me is how you work out how old the fruit is you are selling. You might know from the paperwork that you have purchased x oranges from Spain on one day and y oranges from Italy on another day, but in the box oranges are just oranges - so how do you tell which are which? Or are you planning to use a FIFO or LIFO scheme?

Resources