I have a set of data in excel, which varies in size, that I need to increment the final number by 1 each time I duplicate it. For example:
100065202-TR01
100065204-TR01
100018998-TR01
100065202-TR01
100065204-TR01
100018998-TR01
100065202-TR01
100065204-TR01
IB500-TR01
100005693-TR01
100065202-TR02
100065204-TR02
100018998-TR02
100065202-TR02
100065204-TR02
100018998-TR02
100065202-TR02
100065204-TR02
IB500-TR02
100005693-TR02
I need a way that I can increment TR01,TR02,TR03... up to TR20. The text before the -TR** will remain the same every time, I will simply be duplicating a (sometimes very large) block of data but want to change the final number each time.
If you select the data you need and then "pull" to the right by clicking on the bottom right corner, where the black square is, it is done automatically by Excel.
See result
Related
In order to better analyse pages over time for SEO optimisations and rankings, I've made an excel spreadsheet that pulls in the top 20 pages by impressions and compares it to the top 20 pages by clicks.
I download the report for pages from Google Search Console and copy it into a sheet called PageDataInput. In a sheet called Page it pulls the top 20 pages according to the key metric chosen, either impressions or clicks
The table for impressions works without issue because there aren't any duplicate impressions and there is a low likelihood that there will be. However, this isn't the same with clicks.
Instead of referencing the next page with the same value, it just references the same page with that click for as long as there are duplicate values. Meaning that if there are 5 pages with the same click value, it'll reference the first page 5 times and then move on to the next page.
For Impressions I've used:
=INDEX(PageDataInput!$B$2:$B$952,MATCH(1,INDEX((PageDataInput!$D$2:$D$952=LARGE(PageDataInput!$D$2:$D$952,ROWS(B$2:B2)))*(COUNTIFS(B$2:B2,PageDataInput!$D$2:$D$952)=0),),0))
For Clicks I used the same code just different column to check:
=INDEX(PageDataInput!$B$2:$B$952,MATCH(1,INDEX((PageDataInput!$C$2:$C$952=LARGE(PageDataInput!$C$2:$C$952,ROWS(I$2:I2)))*(COUNTIFS(I$2:I2,PageDataInput!$C$2:$C$952)=0),),0))
The columns are all set correctly.
To deal with duplicates with respect to the amount of clicks you can create a new column which adds an incremental amount to the number of clicks and reference this column instead.
The easiest way to do this is by adding a very small random number:
+RAND()/100000
This will not really change the results but the LARGE function will work fine as the numbers are now essentially distinct.
I have thousands of cells of data that are output from a model, with my results formatted as follows: cell ID is column header, each row is a timestep, and each cell's results over hundreds of timesteps is printed out in a spreadsheet. I want to analyze the data within certain percentiles. I've identified what the values are for the percentile thresholds of interest, but I'm not finding clear directions on how to ...
a) remove all values that are outside of the range I'm interested in, for the sheet I'm working in
or
b) pull the values within the range of interest out of the sheet and into a separate one for further analysis
The values are numbers with two decimal places.
I need to scrub the data, then analyze it, in a separate step. For example, even after removing the extreme max and min in a timeseries, I still want to see the entire timeseries but with the outliers removed or changed to a null value. How can I select or simply remove all the outliers from this data matrix, leaving the rest of the data in tact?
The best way to do it is using the PivotTable feature.
With the PivotTable you will be able to create filter parameters using ranges (the main data and the outliers as well).
Please, take a look on this if you don't know how to use a PivotTable:
Create a PivotTable to analyze worksheet data
Although "Robust" the easiest way to do this would to to filter your data. After that, filter out all of your good values. Once you only have the "bad" cells visible, then go to special and highlight only visible cells. From there, delete.
To do this efficiently / with keyboard shortcuts, it would be like so:
1) Select the data headers and type alt + h + s + f
2) Click the "value" header, and click proper numbers until you have all the data you do not want filtered out.
3) Highlight all of the remaining cells (These should be the data points you want deleted, and the row numbers in excel should be blue now)
4) Type alt + h + f + d + s + y to go to special and select visible cells only
5) Type alt + h + d + r to remove the rows (this will take a bit of time, be patient)
I hope it works!
***EDIT: Instead of manually sorting out the data, you can also rank based off of size, and directly cut the data like that. After thinking more about the answer, this method would be easier (I think) and much faster.
Let's say I need to replace any cell that has a value of "outgoing" with multiple cells such as (0), (1), (0), (0), (2), in Excel. Is there a way to actually make this happen? I am doing this for a research project. Every item in my data needs to be coded on five different scales. There are 30-or-so items make up for almost half of the data. It would be enormously helpful to be able to simply replace the high frequency items with the five values at once.
I am not sure I completely understand the result you are looking for but here goes:
How about using the Find and Replace functionality to replace all instances of "outgoing" with "(0),(1),(0),(0),(2)" and then use the Text to Columns functionality to split the single column with "(0),(1),(0),(0),(2)" in to five separate columns, thus each value would be in its own cell.
You would need to split based on a delimiter (probably ",") and you should do all your replacing before you start splitting. Obviously you should test on some sample data first - Find and Replace is not your friend if you are not certain about your data set.
We are setting up a new way of logging our lot number and I was trying to find a way of doing it without having to manually enter each number.
The number is formatted as 161201-1 "1612" is the date code, "01" is the sub lot, "-1" is the line.
I want to make it so I only have to enter the first instance then can use a function that will add a line each time.
However there are only 4 lines per sub lot so each time the line reaches "-4" the next sub lot should be higher than the previous.
For example 161201-4 would be followed by 161202-1 (the date code stays constant until I type in a new lot).
I can't figure out what I would do for this kind of function.
Put every element of lot number in separate columns and than put it back together with "CONCATENATE" function. I did a sample for you and left the formulas as text in the last row.
I am developing a crystal report and data is also coming fine but I am unable to format the fields in the same, just like a comment field must be multiline but when I select the "can Grow" in Format object section, the display is vague.
Can Grow with a value zero stands for the display will grow to multiple lines till it is displayed fully.
As the grow can happen multiple lines, you can adjust the width of the item so that it will not overlap with other item.
What should You do:
Set can grow property to zero
Resize the item (for which you set cangrow 0) so that it will not overlap the item of the next column.