I am listing the files/directories which are greather than N days using the below commands
DATE=`date +%Y-%m-%d`
dt=`date --date "$dt" +%Y%m%d`
loop_dt=`date -I --date "$dt -1 day"`
*** output of loop_dt = 2018-02-25***
hdfs dfs -ls r /path/ | awk '$6 < "$loop_dt"'
I know the above hdfs command is wrong, But I want to pass the loop_dt varible in awk command, to know the list the files which are older than n days
Note: if I hardcode the date in awk command I am getting the results
To recap what I have said in the comments you need to fix your awk command in the following way:
$ cat file
$ awk -v var="2015-08-12" '{if( $1 < var"") print}' file
Replace 2015-08-12 by your shell variable "$loop_dt" and it should work.
Use this syntax awk -v awkVarName="$shellVariable" 'BEGIN {print awkVarName}' to pass a variable to awk
In if( $1 < var"") the "" force the string comparison.
How to print nth column in a file using awk?To print the second column of file, I try:
awk '{print $2}' file
But if n is a variable, how to print nth column of file using awk?
awk -v var="$n" '{print $var}' file
Give a try this, notice the -v option:
awk -v x=${n} '{print $x}' file
From the man page:
The option -v followed by var=value is an assignment to be done before
prog is executed; any number of -v options may be present.
For more examples, you could check Using Shell Variables in Programs
I am a beginner at Linux and I'm trying to do a project which takes every line from a file.txt and replaces the third word with the first of each line. Here is my Shell code but it doesn't seem to work. It keeps replacing the third word with $field1 and not what's in it.
while IFS=: read -r field1;do
sed -e 's/[^:]*[^:]/$field1/3'
done < file.txt
Try this, this will replace in the same file:
while read -r line
first=`echo $line | awk -F':' '{ print $1 }'`
last=`echo $line | awk -F':' '{ print $3 }'`
echo $line | sed "s/$last/$first/"
done < file.txt
Input file :
Output :
Make note of the single quotation marks. Place them around the field1 variable and so:
sed -e 's/[^:]*[^:]/'$field1'/3'
My current shell is bash, but
echo '1 2 3 4'| awk '{print $2}'
bash -c " echo '1 2 3 4'| awk '{print $2}' "
gives different result. Os is Linux. What's wrong on second statement?
How about using a HERE-document?
bash <<'EOH'
echo '1 2 3 4'| awk '{print $2}'
Note that I have dropped the -c option to bash, and that the single quotes around EOH are necessary to avoid evaluating $2 on the shell level.
In your second command, $2 is evaluated before being handed to awk, which gives the following command :
echo "1 2 3 4" | awk "{print }"
To avoid this, you can use this awful syntax :
bash -c 'echo "1 2 3 4"| awk '"'"'{print $2}'"'"''
Or this syntax suggested by chepner :
bash -c $'echo "1 2 3 4" | awk \'{print $2}\''
The problem is that variable expansion is done twice here : a first time when the bash -c command is evaluated, and a second time when the spawned bash process evaluates its command line.
My initial answer was to change the command to bash -c 'echo "1 2 3 4"| awk "{ print $2 }'", which indeed avoided expansion in your current shell. However, in the spawned bash process, expansion was executed on the following command :
echo "1 2 3 4" | awk "{print $2 }"
And $2 was expanded to the empty string.
So we need this command to be executed by the spawned bash :
echo "1 2 3 4" | awk '{print $2 }'
And we need to surround it with single quotes in the current shell :
bash -c 'echo "1 2 3 4" | awk '{print $2 }''
Except here the awk quotes close the bash -c parameter's quotes, which leads to the above command where we use '"'"' to write a single quote inside a singe-quoted text.
I want to add a word at the end of each line in my text file which is stored in variable. whenever i execute shell script instead of concatenate content stored in variable variable itself get concatenated. Below is the example for same:
cat output2.txt
awk '{print $0"^$att1"}' output2.txt >output3.txt
Desired Output:
Try this:
awk -v att1='Ramesh^Mumbai' -v OFS='^' '{print $0,att1}'
-v option allows to pass variable to awk
OFS is the output field separator (that will replace the , in the print statement by ^)
man awk:
-v var=val
Assign the value val to the variable var, before execution of
the program begins. Such variable values are available to the
BEGIN block of an AWK program.
you can use this;
awk -v att1=$att1 '{print $0"^"att1}' output.txt > output3.txt
user#host:/tmp$ cat output.txt
user#host:/tmp$, ./
user#host:/tmp$ cat output3.txt
I am trying to write a shell script to get certain data from below sample logs..
Below is a sample log:
2014-07-08 16:08:25,684: |ABC_130|1|10123ffffff2|P|489440201
2014-07-08 17:08:25,684: |ABC_130|1|aaaaaxxxxaab|P|489440201
2014-07-08 19:08:25,684: |ABC_130|1|aaaaababbaab|P|489440201
Below is a part of the script where I am facing issue, the issue I am facing is that the awk command doesn't give any output.
DATE_HOUR="`date -d '1 hour ago' "+%Y-%m-%d %H"`"
awk -F ":" '{if ($1='"$DATE_HOUR"') print $0}' log.txt
Don't use shell variable like that in awk. Use -v name=val:
awk -F ":" -v dt="$DATE_HOUR" '$1==dt' log.txt
btw I reduced your awk command to '$1==dt' since print $0 is default action and also if condition can be moved out of curly braces.