With the following function:
=FREQUENCY(C2:C724,D2:D37)
The second parameter is the BIN
What I don't understand is why Excel would increment the BIN for the rest of your values. The BIN does not change! It stays the same bin. Yet when I paste the formula for all my values it does this:
=FREQUENCY(C2:C724,D2:D37)
=FREQUENCY(C2:C724,D3:D38)
=FREQUENCY(C2:C724,D4:D39)
The last column is what was generated (and this is correct but it does not make sense!!)
Etoh bin
15.9 20 0
14.6 19 0
14.1 18 0
13.9 17 0
13.3 16 0
13.3 15 0
13.2 14 1
12.6 13 2
12.1 12 3
11.8 11 6
11.5 10 4
11.2 9 4
11 8 8
10.5 7 10
10.3 6 26
10.3 5 27
10.2 4 40
10.1 3 89
9.8 2 151
9.7 1 205
9.5 0 102
9.1 -1 17
8.9 -2 7
8.3 -3 3
8.1 -4 2
8.1 -5 0
7.9 -6 3
7.6 -7 2
7.5 -8 2
7.5 -9 1
7.5 -10 1
7.4 -11 0
7.2 -12 0
7.1 -13 1
7.0 -14 0
7.0 -15 0
6.8
6.7
6.6
6.5
6.4
6.2
6.2
6.1
6.0
5.9
5.8
5.8
5.7
5.7
5.7
5.5
5.5
5.5
5.4
5.3
5.3
5.3
5.3
5.3
5.3
5.3
5.2
5.2
5.2
5.1
5.1
5.1
5.1
5.1
5.0
5.0
5.0
5.0
5.0
4.9
4.9
4.8
4.8
4.8
4.7
4.7
4.6
4.6
4.6
4.5
4.5
4.5
4.5
4.4
4.3
4.1
4.1
4.1
4.1
4.1
4.1
4.0
4.0
4.0
4.0
4.0
3.9
3.9
3.9
3.9
3.9
3.8
3.8
3.7
3.6
3.6
3.6
3.6
3.6
3.5
3.5
3.4
3.4
3.4
3.4
3.3
3.3
3.3
3.3
3.2
3.2
3.2
3.2
3.2
3.2
3.1
3.1
3.1
3.1
3.1
3.1
3.0
3.0
3.0
3.0
3.0
3.0
3.0
3.0
3.0
3.0
3.0
2.9
2.9
2.9
2.8
2.8
2.8
2.8
2.8
2.8
2.8
2.8
2.7
2.7
2.7
2.7
2.7
2.7
2.7
2.7
2.6
2.6
2.6
2.6
2.6
2.6
2.6
2.6
2.6
2.5
2.5
2.5
2.5
2.5
2.4
2.4
2.4
2.4
2.4
2.4
2.4
2.4
2.3
2.3
2.3
2.3
2.3
2.3
2.3
2.3
2.3
2.3
2.3
2.3
2.2
2.2
2.2
2.2
2.2
2.2
2.2
2.2
2.2
2.2
2.2
2.2
2.1
2.1
2.1
2.1
2.1
2.1
2.1
2.1
2.1
2.1
2.1
2.1
2.1
2.0
2.0
2.0
2.0
2.0
2.0
2.0
2.0
2.0
2.0
2.0
2.0
2.0
2.0
1.9
1.9
1.9
1.9
1.9
1.9
1.9
1.9
1.9
1.9
1.9
1.9
1.9
1.9
1.9
1.8
1.8
1.8
1.8
1.8
1.8
1.8
1.8
1.8
1.8
1.8
1.8
1.8
1.8
1.8
1.7
1.7
1.7
1.7
1.7
1.7
1.7
1.6
1.6
1.6
1.6
1.6
1.6
1.6
1.6
1.6
1.6
1.6
1.6
1.5
1.5
1.5
1.5
1.5
1.5
1.5
1.5
1.5
1.5
1.5
1.5
1.5
1.5
1.5
1.5
1.5
1.5
1.4
1.4
1.4
1.4
1.4
1.4
1.4
1.4
1.4
1.4
1.4
1.4
1.4
1.3
1.3
1.3
1.3
1.3
1.3
1.3
1.3
1.3
1.3
1.3
1.3
1.3
1.3
1.3
1.3
1.3
1.3
1.3
1.3
1.3
1.3
1.2
1.2
1.2
1.2
1.2
1.2
1.2
1.2
1.2
1.2
1.2
1.2
1.2
1.2
1.2
1.2
1.2
1.1
1.1
1.1
1.1
1.1
1.1
1.1
1.1
1.1
1.1
1.1
1.1
1.1
1.1
1.1
1.1
1.1
1.1
1.0
1.0
1.0
1.0
1.0
1.0
1.0
1.0
1.0
1.0
1.0
1.0
1.0
1.0
1.0
1.0
1.0
1.0
1.0
1.0
1.0
1.0
1.0
1.0
1.0
0.9
0.9
0.9
0.9
0.9
0.9
0.9
0.9
0.9
0.9
0.9
0.9
0.9
0.9
0.9
0.9
0.9
0.9
0.9
0.9
0.9
0.9
0.9
0.9
0.8
0.8
0.8
0.8
0.8
0.8
0.8
0.8
0.8
0.8
0.8
0.8
0.8
0.8
0.8
0.8
0.8
0.8
0.8
0.8
0.8
0.8
0.8
0.8
0.8
0.7
0.7
0.7
0.7
0.7
0.7
0.7
0.7
0.7
0.7
0.7
0.7
0.7
0.7
0.7
0.7
0.6
0.6
0.6
0.6
0.6
0.6
0.6
0.6
0.6
0.6
0.6
0.6
0.6
0.6
0.6
0.5
0.5
0.5
0.5
0.5
0.5
0.5
0.5
0.5
0.5
0.5
0.5
0.5
0.5
0.5
0.5
0.5
0.5
0.5
0.4
0.4
0.4
0.4
0.4
0.4
0.4
0.4
0.4
0.4
0.4
0.4
0.4
0.4
0.4
0.4
0.4
0.4
0.4
0.4
0.3
0.3
0.3
0.3
0.3
0.3
0.3
0.3
0.3
0.3
0.3
0.3
0.3
0.3
0.3
0.3
0.3
0.3
0.3
0.3
0.3
0.3
0.3
0.2
0.2
0.2
0.2
0.2
0.2
0.2
0.2
0.2
0.2
0.2
0.2
0.2
0.2
0.2
0.2
0.2
0.2
0.1
0.1
0.1
0.1
0.1
0.1
0.1
0.1
0.1
0.1
0.1
0.1
0.1
0.1
0.1
0.1
0.1
0.1
0.1
0.1
0.0
0.0
0.0
0.0
0.0
0.0
0.0
0.0
0.0
0.0
0.0
0.0
0.0
0.0
0.0
0.0
0.0
0.0
0.0
0.0
0.0
0.0
-0.1
-0.1
-0.1
-0.1
-0.1
-0.1
-0.1
-0.1
-0.1
-0.1
-0.1
-0.1
-0.1
-0.1
-0.2
-0.2
-0.2
-0.2
-0.2
-0.2
-0.2
-0.2
-0.2
-0.2
-0.2
-0.3
-0.3
-0.3
-0.3
-0.3
-0.3
-0.3
-0.3
-0.3
-0.3
-0.4
-0.4
-0.4
-0.4
-0.4
-0.4
-0.4
-0.4
-0.4
-0.4
-0.4
-0.5
-0.5
-0.5
-0.5
-0.5
-0.5
-0.6
-0.6
-0.6
-0.6
-0.6
-0.6
-0.7
-0.7
-0.7
-0.7
-0.7
-0.7
-0.7
-0.7
-0.8
-0.8
-0.8
-0.8
-0.8
-0.8
-0.8
-0.8
-0.9
-0.9
-0.9
-0.9
-0.9
-0.9
-1.0
-1.0
-1.0
-1.0
-1.0
-1.0
-1.1
-1.1
-1.2
-1.2
-1.2
-1.3
-1.3
-1.7
-1.8
-1.9
-1.9
-2.1
-2.2
-2.4
-2.4
-2.5
-2.5
-2.6
-3.0
-3.2
-3.7
-4.6
-4.6
-6.1
-6.3
-6.3
-7.0
-7.8
-8.1
-8.5
-9.0
-10.2
-13.2
If I do as both of you suggested with the $, I am getting the WRONG results:
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
0
It's a bit of keyboarding to make it easier, not mouse work, but do exactly this:
In cell E2, enter
=FREQUENCY(C2:C724,D2:D37). Hit
Enter.
You should now be on cell E3.
Press on the up arrow once on your
keyboard to return to E2 (and it will be 0).
Hold
down the Shift key and press on the down arrow until you've
reached cell E37. E2 through E37 will be highlighted.
Don't do anything else other than
hit the F2 key on your
keyboard now.
Now press and hold down
Ctrl + Shift.
Then, with those two held down, hit
Enter.
Voila! In every cell between E2
and E37 you should see this
{=FREQUENCY(C2:C724,D2:D37)} in the formula bar
(notice the {} brackets) and
the formula works. This is what makes an array formula.
It looks like what you've done is tried to copy and paste the function from the first cell to each of the subsequent cells.
Because it's an array function, what you need to do is:
Copy the text of the first cell (from the formula bar)
Select all the cells you are wanting the results to appear in
Paste the text you just copied into the formula bar
Press ctrl-shift-enter
This will put the same array function into all the cells
Unsure what you mean but it sounds like you need to use absolute addressing, prefix the column with $
You need to use absolute reference for the second argument only =FREQUENCY(C2:C$724,D2:D37)
Related
I need help on the below problem,
Problem:
I have below dataframe. In this dataframe, first row is a title, and first column is also a title column. I have three different rows of 5.0, 10.0, 20.0 in first column (Phi) and I have different values against different values.
df_combined
Phi 0.0 10.0 20.0 30.0 40.0 50.0 60.0 70.0
5.0 -6.7 5.6 -2.7 -1.0 4.4 -6.4 6.3 -4.2
10.0 -3.8 3.1 -1.5 -0.5 2.5 -3.6 3.6 -2.4
20.0 6.3 -5.3 2.6 0.9 -4.2 6.1 -6.0 4.0
5.0 -1.7 5.6 -6.7 -7.0 1.4 -0.4 3.3 -4.2
10.0 -3.8 3.1 -1.5 -4.5 2.5 -1.6 2.6 -4.4
20.0 6.3 -1.3 2.6 0.9 -4.2 6.1 -7.0 4.0
5.0 -0.7 5.6 -6.7 -7.0 1.4 -0.4 3.3 -4.2
10.0 -3.8 3.1 -6.5 -2.5 6.5 -8.6 4.6 -3.4
20.0 6.3 -1.3 2.6 3.9 -3.2 4.1 -5.0 9.0
Expected output:
I want my dataframe lookimg like this where values against 5.0 come together. And same for 10.0 and 20.0. I do not want to aggregate or count or addition of these values. I just want these columns come together next to each other.
Phi 0.0 10.0 20.0 30.0 40.0 50.0 60.0 70.0
-6.7 5.6 -2.7 -1.0 4.4 -6.4 6.3 -4.2
5.0 -1.7 5.6 -6.7 -7.0 1.4 -0.4 3.3 -4.2
-0.7 5.6 -6.7 -7.0 1.4 -0.4 3.3 -4.2
10.0 -3.8 3.1 -1.5 -4.5 2.5 -1.6 2.6 -4.4
-3.8 3.1 -6.5 -2.5 6.5 -8.6 4.6 -3.4
6.3 -5.3 2.6 0.9 -4.2 6.1 -6.0 4.0
20.0 6.3 -1.3 2.6 0.9 -4.2 6.1 -7.0 4.0
6.3 -1.3 2.6 3.9 -3.2 4.1 -5.0 9.0
I tried groupby function like below,
df_combined2 = df_combined.groupby(df_combined['Phi'])
But got below error,
<pandas.core.groupby.generic.DataFrameGroupBy object at 0x000001BE9EC2BDC0>
What should I do?
sort the values by PHI and then make the repeating values as empty
# sort the dataframe
df=df.sort_values('Phi')
#to make repetitive values under PHI as empty
df['Phi']=df['Phi'].mask(df['Phi'].eq(df['Phi'].shift(1)), "")
df
OR
# sort the dataframe
df=df.sort_values('Phi')
# makes PHI as empty when diff from previous value is 0
df['Phi']=df['Phi'].mask(df['Phi'].diff().eq(0), '')
df
Phi 0.0 10.0 20.0 30.0 40.0 50.0 60.0 70.0
0 5.0 -6.7 5.6 -2.7 -1.0 4.4 -6.4 6.3 -4.2
3 -1.7 5.6 -6.7 -7.0 1.4 -0.4 3.3 -4.2
6 -0.7 5.6 -6.7 -7.0 1.4 -0.4 3.3 -4.2
1 10.0 -3.8 3.1 -1.5 -0.5 2.5 -3.6 3.6 -2.4
4 -3.8 3.1 -1.5 -4.5 2.5 -1.6 2.6 -4.4
7 -3.8 3.1 -6.5 -2.5 6.5 -8.6 4.6 -3.4
2 20.0 6.3 -5.3 2.6 0.9 -4.2 6.1 -6.0 4.0
5 6.3 -1.3 2.6 0.9 -4.2 6.1 -7.0 4.0
8 6.3 -1.3 2.6 3.9 -3.2 4.1 -5.0 9.0
I am trying to parse some data from Basketball-Reference, but so far I'm unable to do so. Here is my code for getting the raw html data
import requests
from bs4 import BeautifulSoup
url='https://www.basketball-reference.com/teams/DAL/2021/lineups/'
response=requests.get(url=url)
soup=BeautifulSoup(response.content,'html.parser')
>>soup.find(attrs={'id':"all_lineups_5-man_"}).find('table')
That last line gives an error, when it shouldn't. My guess is that it is happening because of the <!-- highlighted in yellow in the picture below. So my question is, how should I approach this?
You could loop through the comments, grab the tables, and then use pandas.
For example:
import pandas as pd
import requests
from bs4 import BeautifulSoup, Comment
from tabulate import tabulate
url = 'https://www.basketball-reference.com/teams/DAL/2021/lineups/'
response = requests.get(url)
soup = BeautifulSoup(
response.content, 'html.parser'
).find_all(text=lambda text: isinstance(text, Comment))
tables = [c for c in soup if "<div" in c]
frames = [pd.read_html(table, flavor="bs4") for table in tables]
print(tabulate(pd.concat(frames[-1])))
pd.concat(frames[-1]).to_csv("table_4.csv", index=False)
Output:
-- --- ------------------------------- ------ ----- ---- ---- ------ ---- ---- ------ ------ ----- ---- ------ ---- ----- ---- ----- ---- ----- ---- ---- ---- ---- ----
0 1 L. Dončić | T. Hardaway 435:01 1.9 1.8 -0.5 0.023 0.3 2.1 -0.012 0.025 -2 -3 0.012 -2.1 -3.9 -0.5 -3.9 -1.5 -3.2 0.8 -0.4 1.2 -1.1 -0.9
1 2 W. Cauley-Stein | L. Dončić 288:50 8.3 5.2 0.4 0.057 0.7 -1.3 0.032 0.06 -2.9 -3.8 -0.003 -0.6 1.2 5.3 1.2 2.5 5.5 0.6 -1.6 3 0.5 -0.8
2 3 L. Dončić | J. Richardson 266:22 -1.5 0 4.2 -0.023 -0.5 6.1 -0.071 -0.029 -1.1 -1 -0.01 -0.7 -3 -3.7 -3 -2.2 -5 -1.4 2.7 0.8 -4.4 -1.1
3 4 D. Finney-Smith | J. Richardson 252:38 -3.3 -0.5 5.1 -0.034 -0.4 5.8 -0.06 -0.04 -1.8 -2 -0.012 0.2 -1 -3.4 -1 -1.5 -3.5 0.3 0.2 0 -3.9 -1.7
4 5 T. Burke | L. Dončić 251:51 -2.2 3.2 1.1 0.03 -1.4 1.6 -0.052 0.021 -7.2 -9.5 -0.008 -3.8 -7 -1.4 -7 -2.8 -6.1 -3.6 0.3 1 -2.5 1.6
5 6 J. Brunson | T. Hardaway 246:37 -4.4 -2.9 -3.9 -0.012 -3.2 -3.8 -0.05 -0.027 4.5 3.7 0.06 -3.8 -8.1 -2.5 -8.1 -3.2 -7.2 -0.6 -1.2 0 -1.2 -1.6
6 7 T. Burke | J. Johnson 242:07 -2.5 2.3 -0.6 0.029 0.2 0.2 0.004 0.03 -7.2 -8.5 -0.042 -3.6 -7.1 -2.1 -7.1 -2.8 -6.2 -2.1 0 1.9 0 2.8
7 8 L. Dončić | D. Finney-Smith 236:07 -3.5 1.6 6.7 -0.019 -0.2 5.3 -0.055 -0.026 -6.4 -6.4 -0.064 -0.4 -2 -2.7 -2 -1.6 -3.6 0.2 0.9 -0.4 -3.1 -0.3
8 9 T. Hardaway | J. Richardson 230:38 -8.2 -0.7 7.3 -0.048 -0.8 8.8 -0.103 -0.059 -6 -5.8 -0.069 0.2 -2.7 -7.2 -2.7 -3.6 -8.3 -1.1 1 0.7 -5.3 0.1
9 10 L. Dončić | J. Johnson 229:11 -7.7 2.1 -1 0.028 -1.9 -1.8 -0.035 0.018 -10 -9.7 -0.11 -4.7 -9.1 -3 -9.1 -3.7 -7.8 -2.8 -0.9 0.8 -0.3 2.1
10 11 D. Finney-Smith | T. Hardaway 225:53 -1 3.8 10.2 -0.011 0 4.5 -0.04 -0.02 -8.6 -8.8 -0.077 0.9 -0.6 -5.2 -0.6 -2.2 -5.1 2.3 0.7 -0.9 -5.1 -0.1
11 12 W. Cauley-Stein | T. Hardaway 215:03 12.3 6 -0.9 0.071 1.8 -4.1 0.085 0.082 -1.4 -3.8 0.056 -1.9 -1.2 4.4 -1.2 1.6 3.4 3.9 -1.3 2.7 -0.4 0.1
12 13 T. Hardaway | J. Johnson 214:53 -2.7 0.3 -4.9 0.029 -0.1 -6.5 0.06 0.033 -3.1 -3.6 -0.017 -4.9 -9.3 -2 -9.3 -3.5 -7.8 -2 -0.2 1.2 1 2.1
13 14 J. Brunson | J. Johnson 190:02 -11.5 -1.6 -3.6 0.001 -4.7 -8.1 -0.052 -0.024 -3.7 -2.3 -0.081 -4.7 -9.6 -3.4 -9.6 -4 -8.7 -2.3 -1.9 0.5 0.5 1.8
14 15 T. Hardaway | K. Porziņģis 188:30 -12.7 -6.6 -3.2 -0.055 -1.1 4.8 -0.074 -0.059 1.7 -2.9 0.192 -3.2 -7.3 -5 -7.3 -4.5 -9 -0.6 -3.2 0.1 0.8 -1.2
15 16 L. Dončić | K. Porziņģis 181:14 -8.8 -2.5 -2.1 -0.017 -2.6 3.3 -0.1 -0.03 -1.1 -6.7 0.19 -2.5 -4.9 -1.7 -4.9 -2.6 -5.4 0.2 -4.1 1.5 1.8 0
16 17 T. Hardaway | D. Powell 178:57 -3.2 -0.7 2.1 -0.019 2 3.1 0.025 -0.009 -3.9 -3 -0.071 -2.4 -6.7 -6.1 -6.7 -4.3 -9.7 -0.7 1.2 0.3 -1.5 -0.1
17 18 T. Burke | T. Hardaway 177:36 -2.2 -2.4 -3.7 -0.008 2 2.8 0.027 0.006 0.5 -0.9 0.047 -3.3 -6 -1.8 -6 -3 -6.2 -6.2 2 1.3 0.7 0.1
18 19 J. Brunson | L. Dončić 165:52 -3.6 -0.8 -7.2 0.029 -4.4 -5.7 -0.077 0.008 2.5 3.1 0 -6.2 -11.6 -0.7 -11.6 -3.5 -7.8 0.2 -1.4 1.8 1.1 -1.6
19 20 L. Dončić | D. Powell 162:30 -5 0.3 2.3 -0.009 1 3 -0.002 -0.005 -6.6 -4.3 -0.119 -4.3 -11.1 -7.3 -11.1 -5.8 -13.2 0.7 2.3 0.7 -4.3 0.2
20 nan Team Average 965:54 -2.1 0.4 0.6 0.002 -0.7 0.8 -0.025 -0.002 -2.2 -2.3 -0.021 -2.1 -4.7 -2.3 -4.7 -2.2 -4.9 -0.4 -0.6 0.5 -1.8 -0.3
-- --- ------------------------------- ------ ----- ---- ---- ------ ---- ---- ------ ------ ----- ---- ------ ---- ----- ---- ----- ---- ----- ---- ---- ---- ---- ----
Sample output for a .csv file:
I'm trying to work with a dataset that has None values:
My uploading code is the following:
import pandas as pd
import io
import requests
url = "https://archive.ics.uci.edu/ml/machine-learning-databases/statlog/heart/heart.dat"
s = requests.get(url).content
s = s.decode('utf-8')
s_rows = s.split('\n')
s_rows_cols = [each.split() for each in s_rows]
header_row = ['age','sex','chestpain','restBP','chol','sugar','ecg','maxhr','angina','dep','exercise','fluor','thal','diagnosis']
c = pd.DataFrame(s_rows_cols, columns = header_row)
and
the output from c is :
But it seems that there are some columns that has None values.
How do I replace this None values by zeros?
Thanks
I think it is not necessary, if use read_csv with sep=\s+ for whitespace separator and also parameter names for specify new columns names:
url = "https://archive.ics.uci.edu/ml/machine-learning-databases/statlog/heart/heart.dat"
cols = ['age','sex','chestpain','restBP','chol','sugar','ecg',
'maxhr','angina','dep','exercise','fluor','thal','diagnosis']
df = pd.read_csv(url, sep='\s+', names=cols)
print (df)
age sex chestpain restBP chol sugar ecg maxhr angina dep \
0 70.0 1.0 4.0 130.0 322.0 0.0 2.0 109.0 0.0 2.4
1 67.0 0.0 3.0 115.0 564.0 0.0 2.0 160.0 0.0 1.6
2 57.0 1.0 2.0 124.0 261.0 0.0 0.0 141.0 0.0 0.3
3 64.0 1.0 4.0 128.0 263.0 0.0 0.0 105.0 1.0 0.2
4 74.0 0.0 2.0 120.0 269.0 0.0 2.0 121.0 1.0 0.2
.. ... ... ... ... ... ... ... ... ... ...
265 52.0 1.0 3.0 172.0 199.0 1.0 0.0 162.0 0.0 0.5
266 44.0 1.0 2.0 120.0 263.0 0.0 0.0 173.0 0.0 0.0
267 56.0 0.0 2.0 140.0 294.0 0.0 2.0 153.0 0.0 1.3
268 57.0 1.0 4.0 140.0 192.0 0.0 0.0 148.0 0.0 0.4
269 67.0 1.0 4.0 160.0 286.0 0.0 2.0 108.0 1.0 1.5
exercise fluor thal diagnosis
0 2.0 3.0 3.0 2
1 2.0 0.0 7.0 1
2 1.0 0.0 7.0 2
3 2.0 1.0 7.0 1
4 1.0 1.0 3.0 1
.. ... ... ... ...
265 1.0 0.0 7.0 1
266 1.0 0.0 7.0 1
267 2.0 0.0 3.0 1
268 2.0 0.0 6.0 1
269 2.0 3.0 3.0 2
[270 rows x 14 columns]
Then in data are not Nones and no missing values:
print (df.isna().any(1).any())
False
EDIT:
If need replace missing values or Nones to scalar use fillna:
c = c.fillna(0)
The SVG file is an outline of the highland map in Scotland. The closing reads like an error and is in red in Brackets. I'm using an SVG of the UK to reference from...If anyone has experience understanding the SVG file and sees where I've gone wrong, your help would be much appreciated.
<svg class="img-fluid" baseprofile="tiny" fill="#008000" stroke="black" version="1.2" viewbox="0 0 1000 1870" width="100%" xmlns="http://www.w3.org/2000/svg">
<path d="M226.8 714.8l0 1.3 2.5 0-1.3 6.4-1.8 3.7-2.7 1.6-3.6 0.4-1.7-1-1.6-4.2-1.4-1-1.4-0.6-3-3.2-1.8-1 5.7-6.7 3-2.4 3.8-0.5 5 3.2 2.1 2.4-1.8 1.6z m-23.6-5.8l-3.1 0.5-3.1-0.1-2.5-0.6 1.5-1.3 3.3-2 1.8-0.4 1.1 1 2.3 0.6 1.2 0.9-2.5 1.4z m56.7-42.7l-3.2 1.3-2.5-1.1-2.2-2.3-0.6-2.5 2.3-1.4 3.3 0.7 2.5 2.5 0.4 2.8z m-5.5-36.7l-0.4 2.7-1.6 5.3-1.4 6.2 0.1 5.4 0.8 1.3 1.3 3 0.5 2.9-1.6 1.3-1.6 0.7-1.5 1.1-1.6 0.6-1.6-1.1 0.3-1.2 0.4-1.3-1-1.5-0.2-3 0.8-11.4 0.7-3 1.5-1.4 2.8-0.4 0-1.2-1.2-1.8-0.8-1.9 1.1-1.1 1-0.1 1.9 1.2 0-1.3-1.3-2.4 0.8 0.2 0.7 0.5 0.6 0.7 0.5 1z m1.4-6.5l0 2.8-1.5-0.8-0.6-1.8 0.2-2.4 1.1-2.3-0.4-0.5-0.2-0.8 0.6-1.1 0.7-0.8 0.7-0.1 0.6 0.9-1.2 6.9z m-33-30.3l0.9 1.9 0.8 1.1 2.3 1.9 0.9 1.2 2.3 4.1 0.4 0.8 4.3 3.7 1.8 2.4 1 1.7 0.4 1.5 0.9 12.5 0.9 5.3 0.2 2.5-0.3 2.7-0.8 4-0.2 2.4-0.5 1.4-2.6 2.5 0.3 0.4 1.6-1.1 1.7-0.6 1.5 0.5 0.8 1.8-0.6 1.7-1 1.5-0.6 1 0.5 3 1.4 1.8 1.5 1.3 0.6 1.3-0.7 1.6-3.3 2-1.3 1.2 8.3-1.1 2.1 0.6 0.5 1.5-1.2 1.6-2.4 1.2 0 1.2 2.6-0.3 2.1-1.1 2-0.2 2.1 2.2 0.9 0.7 1.3 0.1 2.4-0.2 1.5 0.7 0.9 1.6 0.6 1.7 0.6 0.9 1.4 0.2 0.9-0.8 0.6-1.2 0.5-0.7 4 0.2 2.7-2.8 2.7-1 2.8 0 1.6 2.3 7.2 0-1.6 4.4 0 1.1 0.4 2-0.3 1.7-0.8 1.4-1.9 2.7-3.5 3-1.5 0.7-3.8 0.4-0.6 1.2-0.2 3.9-0.9 2.5-2 1.6-2.1 1.2-0.9 0.8-0.8 1.4-5.6 8-1.5 1.6-1.8 1.2-5.1 1-1.4-0.6-0.9-2.3 0.3-3 2.6-3.9-0.3-1.5 0-1.3 0.9-0.8 0.7-2.5 0.6-0.8 1.8-3.4 0.8-1.2 1.1-1 9.3-4.8 0-1.2-2.7 1.5-2.7 0.8-8.2 0.2-0.5-0.5-0.2-2.3-0.2-1-2.2-3.9-1.3-1.5-1.1-0.7 0.4 1.9 0.4 1.4 1.1 2.8-1 2.6-0.6 1.2-1 1 0.7 1.4-1.2 1.4-0.8 2-0.9 1.7-1.4 0.8-1.2-1.4-0.6-3.1-0.4-3.6-0.1-2.7-0.8-0.9-0.5-0.3-1.4 1.2-0.7-1.1-0.7-0.7-0.9-0.5-0.9-0.1 0 1.1 1.4 1.4-1.7 1.2-13.6 3.6 0-1.4 1.1-0.9 2.9-3.9 0-1.1-1.8-0.1-1.7 0.5-1.6 0.8-1.5 1.2-1.8-2.8-1.5-3.3 1.1-0.5 0.9-0.7 0.7-0.9 0.6-1.5-4.4 0.5-1.6-0.5-4.5-6.1-1.1-3.1-1.2-1.7-0.3-1.3 4.4-3.2 1.9-0.8 2.2 0.3 1.7 1.6 1.7 2.1 2 1.8 2.5 0.6-1.2-2.3-6-5.1-1.3-1.5-0.6-0.6-1.1-0.2-1.1 0.2-2.1 0.9-1.2 0 1.4-1.1 0-1.3-0.7-0.2-1.9-1 1-0.6 0.2-0.7-0.8-0.4-0.7-1-0.2-1.4 0.5-2-1.4 0.4-1.9 2.4-1.3 0.8 0.5-1.5 0.1-1.7-0.4-1.6-0.9-1.4-0.8 2.4-1.1 0.7-1.1 0.5-0.9 1.4-0.1 2.1 1 3.2-0.2 2-1.5 1.4-8.9-3.1-2.2-1.5-1.8-2.1-0.8-2.6-0.9-4.6-3.9-3.1-1.1-4 5.9 0-1.3-1.5-0.9-1.9-0.1-2.2 0.7-2.3 1.3-1.9 1-0.5 0.9 0.6 5 7.5 0.6 1.7 0.9 1.1 1.8 1.4 1.2 0.3-0.5-2.3 0-1.1 1.2-1.2-1.6-2-1.4-2.3-0.2-0.7 0.2-0.8 0.9-0.3 0.2-0.7 0.8-0.8 1.7 0.2 2.8 1.3-0.9-3.2-2.5-2.9-2.9-1.9-2.4-0.7 1.3-1.7 1-2.7 0.2-2.6-1.1-1.5 0-1.4 2 0.1 1.3 1.3 1.1 1.6 2.1 1.5 0.4 1.4 0.2 1.7 0.2 1.1 0.9 1.2 0.7 0.5 4.5 2.5 1 0.9 1.1 1.7 1.4 1.5 1.2-0.1 1.4-0.7 1.6-0.2-2.2 5.4 1 0.3 2-2.7 1.2-3.5 1.2 0.7 7.3 9.7 0-1.2-0.4-0.6-0.9-1.8 0-2.5-1.3-2.2-3.3-4-1.6-3.7 0.3-1.9 1.3-1.7 1.2-3.7-1.1 0.2-1.1-0.2-0.9-0.5-0.7-0.7 0.4-2.5-0.5-1.4-0.8-1.1-0.5-1.2-0.1-2.1 0.2-0.6 0.7-0.2 3.8-2.5 0.6-0.7 0.9-2.1 0.5-2.2 0.7-1.3 1.3 0.6 1-0.6 0.3-0.6z m245.5 13.7l1.6 3.7 0.9 3.7 1.9 1.4 0 2-0.7 1.2 0.6 1.8-0.1 3 2.3 1.8 0.6 2.6 0.2 2.7-0.8 5.6 0.2 2.8-3.3 4.1-0.4 2.1 0.5 1 1.6-0.4 3.4-4.1 5.4-0.6 1.4-1.9 1.7-0.7 7.7-1.6 5.7 5.6 3.9 0.9 1.3 7.1-0.6 2-2.7 2.2-2.6 5.1-2.6 2.1-1.3 2.2 0.3 1.1 3.6 2.3-0.9 8.7 3.3 2.5-0.9 1.8 0.3 1.8-0.9 2.5-1.6 1.9-2.1-0.1-5.4 2.6-2.4 3.2-3 1.9-2.6 3.1-2.6 0.3-3.3 3.4-0.3 0.2-5.6 3.2-0.7 1.9 0.5 1.1-0.9 5.7 0.3 6.4-0.8 1.8-2.6 3-1.5 6-1.9-0.7-5.9 1.7-2-0.6-4-3.4-0.9 0.7-1 2.5-0.6 5.2-0.9 0.7-5.5-1.2-6.4 1.3-6.2-2.4-1.3 0.2-2.3 2.6 0.6 3.1-0.5 1-5 2.9-4.8 1.4-2.9 3.7-3.2-0.5-1.1 0.8-0.3 0.5-1.2 1.1-4.2 6.1-1 0-2.6-1.3-1.9-2.9-1.9-0.2-2.9 4.5-0.7 2.5-2.6 1.5 0.5 4.8-2 2-0.3 1.3 1.1 2.8-5.7 3-5.5 0.9-0.6 1.3 0 3.8-4.6-0.4-3.3-1.8-7 1.2-3-1.7-2.8-0.5-4.5 2.7-4.7 0.6-1.8 3-2.1 0.9-1.4 1.9-6.7-1.6-2.3-2.5-12.3-1.6-0.7 0 0.6-1.1-1.9-1.3 0-1.3 0.9-0.5 2.1-1.9 2.4 0.3 1.3-0.1 2.8-2.6 2.2 0 4.1 1.3 2-0.6 3.8-2.5 7.7-1.8 1.7-1.2-15.9 3.5-7.4-3.6-0.1-1 1-2.5 0.6-0.8 6.5-7.2 1.7-2.9 1.6-3.4-5.7 5.1-0.8 1.4-0.4 1.1-1 1-1.2 0.7-1 0.3-0.8 0.5-0.3 1.3-0.2 1.5-0.3 1.4-1.3 2.6-1.2 1.5-1.5 0.7-1.7 0.1-0.8 0.4-0.9 1.6-2 0.7-0.8 1.1-1.1 2.1-2.7 3.5-1.5 1-2 0.4-1 0.7-1.4 3.2-0.9 0.7-1.2 0.4-1.1 0.9-1 1.2-3.9 5.8-6.3 7.7-6.6 5.3-0.6 0.3-0.9-0.7-1.1-2.3-1-0.5-1.6-0.3-1.8-0.8-1.7-1.2-1.1-1.4 1.4-2.8 0.4-1.4 0.2-1.7-0.7 0-3.6 5-5.8-1.3-14.3-11.9-0.6-1.3 0.1-2.8 1-1.3 1.4-0.5 4.8-0.4 3.3 0.7 3.1 1.6 2.6 2.7 0.8-1.3-2.3-1.9-1-1.2 0-1.7 10.2-6.9 2.3-0.3 6 3.2 2.5 0.5 5.1-0.4 2.8-0.7 2-1.4-8.9 1.2-2.1-0.4-4.8-3.1-4-1-10.5 5.8-3.6 1.1-1 0-1.2-0.5-2.2-1.6-1.2-0.3-6.7 0.5-7.1-1.9-2.4 0.2-1.9 1.9-1.7 1.1-2.4-0.5-2.2-1.3-1.3-1.2-0.9-1.9-0.3-1-0.1-1.2 0.2-1.7 0.5-0.1 0.6 0.4 0.7-0.5 2.1-3.3 1.3-1 14.7-1.6 1.2-0.6 3.3-2.6 1.4-0.5 1.7 0.4 1 0.9 0.9 1.3 1.4 1.1 5.1 2.2 0.8 0.7-0.1-1.9-0.5-1.9-1-1.3-1.7-0.2 0-1.3 1.2-0.2 1.2 0.1 1.1 0.5 1 0.9 1.4-1.8 2.2-0.8 4.4 1.3-7.2-3.6-1.9-2.3 0-1.9 1.3-1.3 10-1.5 3.9-1.7 3.1-3.2-1.2-0.6-1.8 0.2-1.6 0.8-1.5 3-2-0.1-3.7-1 0.8-1.8 3.8-4.2-4.3-0.8-9.1 2-4.4-1.2 3.2-0.9 3.4-0.2 0-1.3-0.7-0.5-2-2 1.7-2.1 1.1-3.6 0.8-4.1 1.1-3.5 1.7-2.2 2.7-1.9 2.9-1.1 2.5 0.5 0.9 1.1 1.9 3.5 1.1 1.4 1.4 1.1 1.5 0.7 3.2 0.5 5.2-1.2 4.6-2.4 0-1.2-1.7 0.5-1.5 0.9-1.4 0.5-1.3-0.7-2.2 1.5-2-0.5-2.1-1.4-1.9-0.8-1.5-1 0.4-2.2 0.1-2.5-2.1-1.5-0.8 0.1-2 0.9-1.1 0.2-0.9-0.4-1.7-1.7-1-0.3-1-1.3 1.4-3 4-5.2 1.1-1.9 0.7-0.7 1.2-0.1 2.8 0.3 1-0.8 1.7-1 2.4 0.9 4.1 3.1 1.9 1.1 9.5 0.1 1.2-1.8 1.6-0.1 1.6 0.4 0.6 0.3 0.7 0 0-1.3-1.9-1.2-8.1 1.4-2.2 0.9-1.3 0.2-0.7-0.6-2.4-3.2-1.1-1-8.1-2.5-1.2-1.8 0.7-3 3.4-4.9 1.3-2.4-1.8-3 2.3-3.4 4-2.6 3.3-0.7 3.1 2.3 3 3.7 3.3 2.5 3.7-1.3 0-1.2-4-0.9-0.9-0.8-1.4-2.6-0.9-1.2-1-0.5-0.9-1.2 1.3-2.7 2.3-2.5 2.3-0.9 0-1.3-2.3-1-1.6 1.4-1.2 2.2-1.2 1.1-1.6 0.6-3.4 2.4-1.9 0.7-2-0.3-3.7-1.7-1.9-0.5-4.1 1.4-2-0.1-0.8-2.5 0.4-0.6 2.2-5.5 0.8-0.8 2.1-1.3 0.8-0.9 0.9-0.8 0.7 0.7 0.5 1.2 0.5 0.6 2.3-0.4 10.7-5.1 4.3-4.1 0.8-5-1.6-0.4-2.6 2.7-4.4 6.2-2.5 2-2.2-0.4-2.3-1.1-2.8-0.5 2.3-1.5 0.8-1.2-0.2-1.5-0.8-2.2-0.2-1.2-0.4 0.1-1.5 1.5-6.5 8.6-2.4 1.2-5 0.8-2.2-1.2-1.7-7 0.3 0.2 0.9-0.9 0.1-0.5-1.1-2-0.2-0.6 0-2.4 0.3-1.1 1-1.2-1-0.6-2.9-0.9-0.7-0.4-0.2-1.9-0.9-3.1-0.2-2.3 0.3-1.3 1.3-3 0.4-1.2 0.1-2.5 0-2.3 0.4-1.4 1.5 0-0.8-3.6 1.8-0.7 1.7 0.3 1.6 1.1 5.2 5.4 1.6 1.2 0.8-0.7 1.7 2.6 0.8 0.6 2.9 2.1 1.1 0.3-0.5-1-0.8-2.7 6.9-1.2 4 0.7 2.2-0.1 0.4-1.2-2.9-2.5-4.4-1-4.2 0.6-2.7 2.4-3.3-3.9 0.6-0.6 0.7-1.7-3.2-0.7-1.6-2.8-1.2-3.5-1.8-2.9-1.5-0.6-1.5-0.2-1.2-0.7-0.5-2.2 0.6-2.2 1.9-2.2-0.5-1.7 0-1.3 3.6-2.1 1.8 0 2.8 3.2 1.2-0.2 1-1.4 0.8-1.9-1.1-0.4-0.6-0.9-0.7-1.2-0.9-1.1-1.2-0.7-8-1.9-0.3-1.9 0.5-4.7-0.2-2.1 0.3-1.5-0.8-3.6-0.2-2.3 0.4-1.9 0.7-1.8 1-1.5 1.3-1.1 2.4-0.9 2.5-0.1 2.5 0.8 2.4 1.6-0.4 2.4 0.3 3.6 0.7 3.6 0.7 2.7 1 1.9 1.2 1.3 1.5 0.8 1.6-0.1-0.8-0.8-0.4-0.9-0.7-2.2 1.2-0.1 0.9 0.3 1.8 1.1-0.6-3.5 0.4-2.6 0.1-2.3-1.3-2.8-3.7-3.5-1.2-2 1-1.9-0.8-0.2-0.5-1 0.9-1.7 2.2-3 0.8-1.5 1.4 1.8 1 0.7 3.6 0-0.6 5 2.7 3.6 4 2.1 3 0.4-0.5-1.2 1.1-1.6 1.2-5.1 0.9-2 1.2 0.1 4.4 1.9 1.3 1.1 2.7 3.3 3.2 2 7 2.7-1.5-2.5-1-1.1-2.8-1.3-4.3-4.4-6.1-2.8-0.8-0.8-0.4-1.9 0.4-1.2 0.6-0.1 0.7 0.8 0.7 0.1 0.6-2.1 1.8 1.1 2.7 4 2.1 1.1 1.1 0 1.8-1 1-0.2 1.2 0.4 2.2 1.6 0.9 0.4 1.2 0.7 3.6 3.5 4 1.9 2.8 5 1.9 1.3-0.5-1-0.7-1.6-0.5-1.7-0.3-1.3-0.5-1.2-9.6-8.5-2.3-0.6 0-1.4 1.9 0.1 0.7-1.4-0.1-2.1-0.9-2.2-1.2-1.1-3.1-0.6-1.3-1.3-2.9-0.7-3.9-5.1-2.5-0.5 0.7-1.1-1.4-2.1-1.9-0.7-3.9 0.3 0.7-2 0.6-0.6 0-1.3-3.2-5.8 1.8-2.4 3.7 1 2.3 4.8 2.1-1.8 2 0.7 2.2 0.3 2.3-2.9-0.6 0 0.4-1.8 0-1.7-0.5-1.5-0.7-1.2 1.4 0 0-1.3-2-1.3 1.3-1 2.7-0.7 1.3-0.7 0-1.4-3.2 0.2-1.5-0.4-1.3-1 0.7-1.2-0.7-0.9-0.8-0.7-0.9-0.5-0.9-0.3 0.5-3 0.2-0.9-1.2 0-1-0.7-4.4-6.9 0.1-3 1.5-0.5 2.2 0.9 4.1 2.9 1.6 0.4 1.6-0.4 3.9-2.6 2-0.7 2.1 0.2 2 1.7 0.7-1.3 1.3-1.2 1.4-0.9 0.8-0.3 1.7 0.5 2.9 2.6 1.3 0.6 4.8-0.4 1.5 0.4 6.6 5.1-0.9-1.9-1.1-1.2-2.6-2 1.2-1.1 2.8-0.5 1.3-0.9-6.1-1.2-1.9 0-0.8 0.5-1.1 1.6-1 0.3-1-0.3-2-1.7-1-0.4-1.4-1-2.9-4.3-0.9-1-0.2-0.7-1.8-2.8-0.3-0.3-0.8-2.4-0.5-2.6 1.9 0-1-2 0.1-1.8 0.7-1.8 0.9-2 0.1-0.9-0.2-2.3 0.4-0.4 0.7 0.2 1.3 0.8 0.6 0.1 5.6 4 4-0.2-2.7-2.2-0.9-1.2 0.3-1.5-0.6-1.3-0.9 0.9-1 0.5-1 0.1-1.1-0.2 0-1.3 1.3 0 0-1.3-0.7-0.1-1.3-1.3 5.3-3.1 2.4 1.1 2.9 2-10.6-9.3 0-3.4 0.2-1.5 0.7-0.9 5.1-3.1 1.5-1.5 1.1-2.9-0.7 0 0.8-3.8 0.3-3.7 0.7-3.2 1.6-2 12.3 2.8 2.3 1.5 1.7 2.3 0 0.9 0.5 2.9 0.6 2.5 0.3-0.2-0.1 2.1-0.2 1.6-0.5 1.4-0.6 1.3 1.8-1.4 0.7-1.7 0.2-1.9 0-1.9 0.4-2.7 0.8-1.2 0.6-1.3-0.5-3.1 1.7 1.5 3 4 1.6 0.8 2.7 2.1 1.2 0.4 1 0.7 0.1 1.5-0.4 1.7-0.4 1.2-1.1 1.5-3.4 3.3-1.8 4.7-1.8 3.4-0.9 2.5 1.8-0.3 5.7-6.4 0.9-0.5 0.6-2.9 1.6-1.3 1.9-0.9 1.8-1.8 1.1-3.1 0.5-3.4 1-2.6 2.3-1 4.2 0.8 4 1.8 4.7 3.5 0 1.4-0.2 0.8 0.2 1.8-2.2 2.7-1.5 3.3-1.2 3.8-1.7 4 1.7-0.7 1.6-1.6 2.6-3.9 0.1-0.8-0.1-1.5 0.8-0.6 3.9-4.3 0.2-0.4 3.8-2.6 2.5-1.2 1-0.2 1.9 0.7 2.9 2.7 2.1 0.6-1.3-1.3 1-0.8 1-2.2 0.6-0.7 1.1-0.2 1.9 1 1.3 0.2 0.8-0.5 0.8-2.4 1-0.6 3.7 0 2 0.6 0.9 0.1 1-0.7 0.9-1.3 1.2-3.1 0.9-0.6 0.8-0.2 0.3 0.2 1.8 3.6 0.5 0.5 2.2-0.5 8.2 1.4 8.2-1.4 4.2-1.9 6.5-5.1 3.7-1.8 6-1.1 3.1 0 2.2 1.1-1.3 1.3 2.3 2.2 2.7-0.3 5.5-1.9 5.3 2.4 2.6 0 1.3-3.7-1.6-0.7-1.8-1.3-1.5-1.9-1-2.5 2.1-3 1.5-1 1.6 0.3 0.4 1 0.2 1.7 0.4 1.7 0.7 0.7 2.9 0 6.3-2 3.9-0.4 5 3.6 3.2 0.1 6.3-1.3 3.3 1.4-0.6 3.4-1.9 3.8-0.4 2.7-1.1 1.5-1.4 3.4-0.9 1.5-2.5 2.5-0.7 1.2-1.1 3.5 0 3 1 2.2 2 1.3 1.2-0.1 1.1-0.7 0.9-0.2 0.8 1 0.3 2.2-0.4 1.6-0.9 1-1 0.3-0.9 6.5-4.2 8.3-5 7.4-3.7 4.2-4 2.4-8.4 3-3.4 2.1-5.1 5.2-0.8 1.7-0.8 3-1.8 3.1-4 4.5-21.1 17.5-7.1 3.7-1.8 1.8-0.8 1.9-0.9 3.7-0.6 1.3-0.8 0.9-2.4 1.4-3.1 0.7-6.1 4.3-1 0.6-0.6 1.1-0.4 1.5 0 1.9-1.6-0.9-0.4-0.4 0.6-1.2-1.3-0.7-2.5-0.3-2.7-1.7-0.5 0.8 0.3 1.5 0.8 1.6 1.5 1.1 2.9 0.1 1.5 0.7 0.7 0.8 2 3.5-2 3.4-0.5 1.9 0.5 2-1.5 0-2.9-1.1-1.5 0-0.8 0.7-1.7 2.3-0.8 0.7-2.7-0.1-5.5-2.9-2.7-0.7-7.7 2.1-2.1-1.5-3.1-4.4-1.9-1.7-1.9-0.7 5.1 7.7 3 2.6 3.7 0.8 3.9-1.8 2-0.4 1.7 1.6 1.6 2.1 2.3 1.6 1.6 0-0.6-3.1 6.2 4.3 3.3 0.9 5.5-4.3 3 0.7 1.7 2.3-1.3 2.3 3.2 0.9 3.5-1.6 3.5-3.3 2.7-4 1.4-1.7 1.2 0.2 0.7 1.7-0.3 3-0.8 1.6-13 17.2-2.2 1.8-2.5 3.9-1.3 1.3-1.9 0.4-1.5-0.7-0.5-1.8 1.2-2.8-2.6-2.5-3.3 1.2-7.4 6.8-1.3 0.6-2.5 0.1-6.2 1.8-1.8-0.3-1.6 0.1-0.7 2-0.7 2.5-1.4 1.2-1.4 0.6-6.2 6.3-0.9 1.2-0.8 1.8-0.8 2.8 0.5 0.1 1.3-1 2.9-1.2 1.4-1.5 1.4-1.8 0.9-1.7 1.2-1.7 4.4-2.7 3.7-3.9 4.1-2.1 0.8-0.1 1.1 0.5 1.2 1.6 4.9 1.1 2.1-0.1 1.5-1.2 1.2-1.3 4.2-2.7 1.5-0.3 0.4 2.3-2 3.7-2.7 3.5-2.8 2.3-3.4 5.4 0.6 1.6 0.7 0.9-4.5 0.4-1.8 1.1-1.6 2.1 0.9 1.7-0.7 0.2-2.2-0.7-5.2 0 0 1.4 2.7-0.1 1.3 0.3 1.2 0.9-1.8 4.1-1 1.8-1.1 1.4 3.4 0.9 1.8 0.1 2-1 1.5-2 0.8-1.1 0-0.6 2.2-0.1 1-0.6 1.8-2.6 1.4-1.4 1.5-1 1.4-0.4 1.7-0.8-1.1-2-3.6-3.3 2.7 0.2 8.8-1.4 9.2 0.9 3-0.9 7.9-6z" id="GBR2745" name="Highland">
</path>
</svg>
In R programming language I can do following:
x <- c(1, 8, 3, 5, 6)
y <- rep("Down",5)
y[x>5] <- "Up"
This would result in a y vector being ("Down", "Up", "Down", "Down", "Up")
Now my x sequence is an output of the predict function on a linear model fit. The predict function in R returns a sequence while the predict function in Spark returns a DataFrame containing the columns of the test-dataset + the columns label and prediction.
By running
y[x$prediction > .5]
I get the error:
Error in y[x$prediction > 0.5] : invalid subscript type 'S4'
How would I solve this problem?
On selecting rows:
Your approach will not work, since y, as a product of Spark predict, is a Spark (and not R) dataframe; you should use the filter function of SparkR. Here is a reproducible example using the iris dataset:
library(SparkR)
sparkR.version()
# "2.2.1"
df <- as.DataFrame(iris)
df
# SparkDataFrame[Sepal_Length:double, Sepal_Width:double, Petal_Length:double, Petal_Width:double, Species:string]
nrow(df)
# 150
# Let's keep only the records with Petal_Width > 0.2:
df2 <- filter(df, df$Petal_Width > 0.2)
nrow(df2)
# 116
Check also the example in the docs.
On replacing row values:
The standard practice for replacing row values in Spark dataframes is first to create a new column with the required condition, and then possibly dropping the old column; here is an example, where we replace values of Petal_Width greater than 0.2 with 0's in the df we have defined above:
newDF <- withColumn(df, "new_PetalWidth", ifelse(df$Petal_Width > 0.2, 0, df$Petal_Width))
head(newDF)
# result:
Sepal_Length Sepal_Width Petal_Length Petal_Width Species new_PetalWidth
1 5.1 3.5 1.4 0.2 setosa 0.2
2 4.9 3.0 1.4 0.2 setosa 0.2
3 4.7 3.2 1.3 0.2 setosa 0.2
4 4.6 3.1 1.5 0.2 setosa 0.2
5 5.0 3.6 1.4 0.2 setosa 0.2
6 5.4 3.9 1.7 0.4 setosa 0.0 # <- value changed
# drop the old column:
newDF <- drop(newDF, "Petal_Width")
head(newDF)
# result:
Sepal_Length Sepal_Width Petal_Length Species new_PetalWidth
1 5.1 3.5 1.4 setosa 0.2
2 4.9 3.0 1.4 setosa 0.2
3 4.7 3.2 1.3 setosa 0.2
4 4.6 3.1 1.5 setosa 0.2
5 5.0 3.6 1.4 setosa 0.2
6 5.4 3.9 1.7 setosa 0.0
The method also works along different columns; here is an example of a new column taking values 0 or Petal_Width, depending on a condition for Petal_Length:
newDF2 <- withColumn(df, "something_here", ifelse(df$Petal_Length > 1.4, 0, df$Petal_Width))
head(newDF2)
# result:
Sepal_Length Sepal_Width Petal_Length Petal_Width Species something_here
1 5.1 3.5 1.4 0.2 setosa 0.2
2 4.9 3.0 1.4 0.2 setosa 0.2
3 4.7 3.2 1.3 0.2 setosa 0.2
4 4.6 3.1 1.5 0.2 setosa 0.0
5 5.0 3.6 1.4 0.2 setosa 0.2
6 5.4 3.9 1.7 0.4 setosa 0.0