Bin expression by count

Bin expression by count - spotfire

I need Spotfire to bin the count of a field. For example if I have a field named COST_CENTER, I need to show 3 bins as the X axis on a chart.
Bin 1: COST_CENTERs that shows up once in the data table go in here
Bin 2: COST_CENTERs that show up 2-3 times
Bin 3: COST_CENTERs that show up 4+ times
Any ideas?

On the image below you can see the data and the chart.
I created two calculated columns.
One is called Count([Job]) but its formula is: Count([JOB]) OVER ([JOB]).
This first column returns in each row the number of occurences of a particular job name within the dataset.
That would be your COST_CENTER.
Then I defined/inserted a binned column with the formula of: BinBySpecificLimits([Count([JOB]])],10,30,100)
This Binned column is shown on the chart. So basically it collects in first bin all the Jobs
that shows up between 10 and 30 times and all the jobs that shows between 30 and 100 times in the second bin.
Last third bin with jobs showing up more than 100 times is empty.

Related

Finding the minimum-maximum differennces between 2 value which are not stable in Excel

I want to prepare 2 excel cells shows the daily minimum and maximum of the difference between two continuously variable numbers live. For example 15,0139 - 15.096 = 0.0821, here 15,0139 and 15,096 are changable (live gold prices from different market) so difference can change.
There are 2 value on the top of the image which are starting 15 and theirs differences bottom of them.

In Excel: How to built a graph with time as X, names as Y with multiples series?

I am looking for a way to make a specific graph in Excel and I can't find a solution in Excel or on the web.
I have data about an online training with people completing parts of a course at a certain time:
FullName
Course
TIME
Name-A
Part 1
23/03/2022 10:38
Name-A
Part 2
23/03/2022 12:07
Name-A
Part 3
23/03/2022 16:55
Name-B
Part 1
11/03/2022 15:14
Name-B
Part 2
22/03/2022 12:08
Name-B
Part 3
28/03/2022 16:06
Name-B
Part 4
30/03/2022 14:55
Name-B
Part 5
18/04/2022 08:13
Name-C
Part 1
11/04/2022 15:25
Name-C
Part 2
20/04/2022 13:50
I would like to have a specific graph of this data:
On the vertical axis: one row for each user' name: Name-A, Name-B and Name-C.
On the horizontal axis: continuous time (say, in days) From the minimum time in the table (or less) to the maximum (or more)
Series of plots for the data: Each part of the course (from Part 1 to Part 5 here) would be a series of dots of a specific color, placed on the right row (for a learner's name) above the corresponding time on the horizontal axis.
Do you have any idea on how it could be achieved?
All the best, R.S.
Edit: The table does not appear as in the preview so i try to add a screenshot:
Screenshot of the table

So one way to visualise this as mentioned in the comments is to create a separate series for each person and show passing each part of the course as a vertical step:
It's based very loosely on this but I've set each day in the date range as the x-coordinates and used a lookup to transform the data in H2
=RIGHT(XLOOKUP($G2+TIME(23,59,59),FILTER($C$2:$C$11,$A$2:$A$11=H$1),FILTER($B$2:$B$11,$A$2:$A$11=H$1),0,-1))+(COLUMN()-COLUMN($G$1))*10
pulled down and across to give
Explanation
The data for the graph has dates spanning the times in the raw data for its x-coordinates (column G). I generated it manually but could have used Sequence in Excel 365.
There are three columns of y-values, H to J, generating a separate series for each person. The three lines are initially spaced out by 10 units based on the column number. In the formula above, the raw data is filtered by the person's name so the headers in columns H, I or J match the names in column A in the raw data. Xlookup is used with 'next smallest' match so where the date in column G is greater or equal to the date/time in column C it will return the corresponding course from column B. Because column C actually contains date/times, I have added almost 24 hours when matching the date in column G to make sure that a match is found if the day is the same, regardless of time. In a case like Name-A, where three courses are completed in the same day, this will automatically select the last one (Part 3). Then I take the right-hand character of the course name (which is a digit in the sample data) and add it to the relative column number multiplied by 10. If there is no match, Xlookup returns zero so you just get the initial value for each series (10, 20 or 30), otherwise the result will be an increase by one unit each time a course is passed. If you couldn't assume the last character of the course name was a digit, you would need a lookup to assign a number to each course name.
The data is then plotted on a scatter graph with points joined by straight lines. I had to adjust the x-axis manually to make the range correct and the labelling clearer.
This could be done without Excel 365, probably using Aggregate to get the highest row number with a condition on the name and date.
EDIT
I could have achieved the same result much more easily using Countifs to find how many courses had been passed by a certain person by a certain date:
=COUNTIFS($A$2:$A$11,H$1,$C$2:$C$11,"<="&$G2+TIME(23,59,59))+(COLUMN()-COLUMN($G$1))*10
This wouldn't have needed Excel 365. If you needed to give different courses different weightings, you could do this with a sumproduct and a lookup, also fairly straightforward.

How to calculate average of parent category in excel?

I have data in below format. It shows starting and end time of an activity and calculates duration accordingly. The activity is performed through out the day at different times.
I have added a pivot. I want to find out the average duration in a workday or a holiday(Day category). When I am trying to apply average in the current pivot, it is dividing the total duration by the number of sessions in a day.For example in week 1, an activity was done on 4 work days and the total duration for the activity in workdays was 04.19, I want to divide this number by 4 and find out the average time spent on each day but the pivot divides it by 11 which is the total number of sessions in the four days.
Link for data

Steps:
Add a helper column to identify how many unique pairs of Dates/Day Categories there are:
=IF(SUMPRODUCT(($A$2:$A2=A2)*($B$2:$B2=B2))>1,0,1)
You can add extra products to this formula to force extra fields to be unique to be counted as well.
SRC:Simple Pivot Table to Count Unique Values
Add a Calculated Field in the PivotTable that is:
SUM(Duration)/SUM([Helper Column Name]) and include it in the 'Values' section of the PivotTable. Due to the new column being added, you might have to re-create the PivotTable.
This should produce the average in the manner that you want.

Spotfire: calculating difference between two rows in same column based on attributes in different columns

I am new to Spotfire and need help in getting the right expression for a calculated column.
My Data contains different subjects grouped in column ID. For every ID, Bodyweight was measured on different days. Days are given in column Day and stated as 1,2,3...
The last day is denoted by Last and Bodyweight measurements given in another column. Another column is present which is called Baseline. The Body weight measured is considered as baseline if the column contains a Y for that row.
I need to insert a calculated column, which will contain the difference between Body measurement measured on Day denoted Last and Body measurement marked by Y in column Baseline.
This should be done for every new ID. I am not able to figure this out. Could someone advise me on how to go about it?
Here is an example attached
So, the calculated column for Rita will give -4 (body weight at Last=56 and BodyWeight at baseline=56, so 52-56 =-4)

the sample data you provided is a little weird, particularly the [Day] column. if it's within your control, I suggest to use actual dates rather than a number/string here.
barring that, I was able to get your desired results, but it required two calculated columns: the first one will consolidate the [Day] and [Baseline] columns into a single column, and the second one contains your desired info.
column 1, which I called Day (int):
CASE
WHEN [Day]="Last" THEN 1000000
WHEN [Baseline]="Y" THEN -1000000
WHEN [Day]!="Last" THEN Integer([Day])
END
I picked a random high and low max to establish a chronological order. this will put 1000000 in place of "Last" (if you have any programs that are longer than one million days, you'll need to increase this number). the same for the [Baseline] column, but that value will be -1000000, which is presumably the lowest value you will ever see in this column. both of these are assumptions and may not work for your implementation. finally, in all other cases, the day number will be used.
column 2, which I called Diff:
Last([Weight]) OVER (Intersect([Name],LastNode([Day (int)]))) -
First([Weight]) OVER (Intersect([Name],FirstNode([Day (int)])))
the first line uses what's called an OVER expression to retrieve the first value for [Weight], ordered by [Day (int)], per [Name]. the second line gives the reverse of that, and so the difference is calculated as -4 (or whatever is appropriate).

Pivot Chart - Using detail values for Value Set 1 and Grand Total Values for Value Set 2

I currently have a dataset that consists of 30,000 records. Each record contains the Action Performed, Week, Person who Completed the Work, QC Flag. So in a day we might have 8 people work 3 tasks for a total of 700 work items. We randomly sample 40 of those cases and they fail 3 of those 40 collectively.
I would like a pivot chart that has a stacked column showing the portion of the total work allotted to each task. <---- Easy and completed
Then on the secondary axis I want to show the fail rate... not as a percentage of the total but as a percentage of those qc.
So, I can get the pivot data to say:
Week [1]...QC Fail [3]...QC Pass [37]...QC Blank [660]
...at the Task level. But the output shows the fail rate per task and I do not want that. I want task delineated by volume and the fail rate of the total population.

After a lot of meddling and trial and error I figured out a solution. It's a dirty workaround but it gets the job done. I created one flag field per task as calculated fields in my data. In addition I created flag fields for all items that failed QC. Then my pivot used Week as Row and nothing but values as columns. My values were then count of each task type, sum of my fail flag field, count of QC items. I then displayed the tasks as a stacked column and the fails and total QC as a stacked 100% line and I changed the scale to only display up to 15% and I deleted the total QC line. I will now have to add a flag field for each new task added. Dirty... but after 7 hours I'm done.

Develop Reference

node.js excel linux python-3.x azure haskell apache-spark rust .htaccess string

Bin expression by count - spotfire

Related

Finding the minimum-maximum differennces between 2 value which are not stable in Excel

In Excel: How to built a graph with time as X, names as Y with multiples series?

How to calculate average of parent category in excel?

Spotfire: calculating difference between two rows in same column based on attributes in different columns

Pivot Chart - Using detail values for Value Set 1 and Grand Total Values for Value Set 2

Categories

Resources