Logstash to convert epoch timestamp - logstash

I'm trying to parse some epoch timestamps to be something more readable.
I looked around for how to parse them into a normal time, and from what I understand all I should have to do is something like this:
mutate
{
remove_field => [ "..."]
}
grok
{
match => { 'message' => '%{NUMBER:time}%{SPACE}%{NUMBER:time2}...' }
}
date
{
match => [ "time","UNIX" ]
}
An example of a message is: 1410811884.84 1406931111.00 ....
The first two values should be UNIX time values.
My grok works, because all of the fields show in Kibana with the expected values, and all the values fields I've removed aren't there so the mutate works too. The date section seems to do nothing.
From what I understand the match => [ "time","UNIX" ] should do what I want (Change the value of time to be a proper date format, and have it show on kibana as a field.) . So apparently I'm not understanding it.

The date{} filter replaces the value of #timestamp with the data provided, so you should see #timestamp with the same value as the [time] field. This is typically useful since there's some delay in the propagation, processing, and storing of the logs, so using the event's own time is preferred.
Since you have more than one date field, you'll want to use the 'target' parameter of the date filter to specify the destination of the parsed date, e.g.:
date {
match => [ "time","UNIX" ]
target => "myTime"
}
This would convert the string field named [time] into a date field named [myTime]. Kibana knows how to display date fields, and you can customize that in the kibana settings.
Since you probably don't need both a string a date version of the same data, you can remove the string version as part of the conversion:
date {
match => [ "time","UNIX" ]
target => "myTime"
remove_field => [ "time" ]
}

Consider also trying with UNIX_MS for milliseconds.
date {
timezone => "UTC"
match => ["timestamp", "UNIX_MS"]
target => "#timestamp"
}

Related

LogStash dissect with key=value, comma

I have a pattern of logs that contain performance&statistical data. I have configured LogStash to dissect this data as csv format in order to save the values to ES.
<1>,www1,3,BISTATS,SCAN,330,712.6,2035,17.3,221.4,656.3
I am using the following LogSTash filter and getting the desired results..
grok {
match => { "Message" => "\A<%{POSINT:priority}>,%{DATA:pan_host},%{DATA:pan_serial_number},%{DATA:pan_type},%{GREEDYDATA:message}\z" }
overwrite => [ "Message" ]
}
csv {
separator => ","
columns => ["pan_scan","pf01","pf02","pf03","kk04","uy05","xd06"]
}
This is currently working well for me as long as the order of the columns doesn't get messed up.
However I want to make this logfile more meaningful and have each column-name in the original log. example-- <1>,www1,30000,BISTATS,SCAN,pf01=330,pf02=712.6,pf03=2035,kk04=17.3,uy05=221.4,xd06=656.3
This way I can keep inserting or appending key/values in the middle of the process without corrupting the data. (Using LogStash5.3)
By using #baudsp recommendations, I was able to formulate the following. I deleted the csv{} block completely and replace it with the kv{} block. The kv{} automatically created all the key values leaving me to only mutate{} the fields into floats and integers.
json {
source => "message"
remove_field => [ "message", "headers" ]
}
date {
match => [ "timestamp", "YYYY-MM-dd'T'HH:mm:ss.SSS'Z'" ]
target => "timestamp"
}
grok {
match => { "Message" => "\A<%{POSINT:priority}>,%{DATA:pan_host},%{DATA:pan_serial_number},%{DATA:pan_type},%{GREEDYDATA:message}\z" }
overwrite => [ "Message" ]
}
kv {
allow_duplicate_values => false
field_split_pattern => ","
}
Using the above block, I was able to insert the K=V, pairs anywhere in the message. Thanks again for all the help. I have added a sample code block for anyone trying to accomplish this task.
Note: I am using NLog for logging, which produces JSON outputs. From the C# code, the format looks like this.
var logger = NLog.LogManager.GetCurrentClassLogger();
logger.ExtendedInfo("<1>,www1,30000,BISTATS,SCAN,pf01=330,pf02=712.6,pf03=2035,kk04=17.3,uy05=221.4,xd06=656.3");

Logstash : Unrecognized #timestamp value, setting current time to #timestamp, original in _#timestamp field

I have json log messages sent to logstash which looks like :
{"#timestamp":"2017-08-10 11:32:14.619","level":"DEBUG","logger_name":"application","message":"Request processed in 1 ms"}
And logstash configured with :
json {
source => "message"
}
date {
match => ["#timestamp", "yyyy-MM-dd HH:mm:ss.SSS"]
timezone => "Europe/Paris"
}
But I have this warning in the logs :
[2017-08-10T11:21:16,739][WARN ][logstash.filters.json ] Unrecognized #timestamp value, setting current time to #timestamp, original in _#timestamp field {:value=>"\"2017-08-10 11:20:34.527\""}
I tried different configurations, like adding quotes around the space, renaming the field with a mutate before the date filter (wich result with the same warning, and an error saying that the timestamp is missing), etc...
In the values stored in elastic search, the timestamp is the time the log was parsed and not the original (2/3 seconds after).
What am I missing ?
I think the problem is that the field in the source message is named #timestamp, just like the default.
We solved it by renaming the field in the source, add changing the config to :
date {
match => ["apptimestamp", "yyyy-MM-dd HH:mm:ss.SSS"]
timezone => "Europe/Paris"
}

logstash convert time to date time

I am using logstash to push data from filebeat to elasticsearch. My data has time as hh:mm:ss a (05:21:34 AM). I want to add today's date to it.
This is filter of logstash config
filter{
grok{ some grok pattern to get time}
date {
locale => "en"
match => ["time", "hh:mm:ss a"]
target => "#timestamp"
}
}
But data converted as 2016-01-01T05:21:34.000Z
How can I change it to 2016-10-14T05:21:34.000Z?
I think logstash is smart enough to use the current year (as you're seeing), but it's not defaulting the other fields.
You should make a new field with the full datetime string you want. Something like this should work between your grok and date:
grok { }
mutate {
add_field => { "datetime" => "%{+YYYY.MM.dd} %{time}" }
}
date { }
Be sure to change your date{} pattern to use the new datetime field and its format. If you don't want the datetime field after date{} is called, you can either use a metadata field instead, or remove_field as part of date{}.

How to add a new dynamic value(which is not there in input) to logstash output?

My input has timestamp in the format of Apr20 14:59:41248 Dataxyz.
Now in my output i need the timestamp in the below format:
**Day Month Monthday Hour:Minute:Second Year DataXYZ **. I was able to remove the timestamp from the input. But I am not quite sure how to add the new timestamp.
I matched the message using grok while receiving the input:
match => ["message","%{WORD:word} %{TIME:time} %{GREEDYDATA:content}"]
I tried using mutate add_field.but was not successful in adding the value of the DAY. add_field => [ "timestamp","%{DAY}"].I got the output as the word ´DAY´ and not the value of DAY. Can someone please throw some light on what is being missed.
You need to grok it out into the individual named fields, and then you can reference those fields in add_field.
So your grok would start like this:
%{MONTH:month}%{MONTHDAY:mday}
And then you can put them back together like this:
mutate {
add_field => {
"newField" => "%{mday} %{month}"
}
}
You can check with my answer, I think this very helpful to you.
grok {
match => { "message" => "%{TIMESTAMP_ISO8601:time} \[%{NUMBER:thread}\] %{LOGLEVEL:loglevel} %{JAVACLASS:class} - %{GREEDYDATA:msg}" }
}
if "Exception" in [msg] {
mutate {
add_field => { "msg_error" => "%{msg}" }
}
}
You can use custom grok patterns to extract/rename fields.
You can extract other fields similarly and rearrange/play arounnd with them in mutate filter. Refer to Custom Patterns for more information.

Logstash date filter in a special case

I have the following format for my logs:
201407022000.log:2014-07-02 20:00;10.112.64.250;3;972819;ULC Primeline
Since it's a csv I could split the first parts pretty easily like this:
csv {
columns => ["fulldate","ip","port","electricity","customer"]
separator => ";"
remove_field => "message"
}
Now I want to split my fulldate field into a "whatever comes before the date" ( 201407022000.log: ) and the actual date field ( 2014-07-02 20:00 )
I tried to use the date filter like this:
date {
match => [ "fulldate", "YYYY-MM-dd HH:mm" ]
timezone => "Europe/Berlin"
}
I receive the following error:
Failed parsing date from field {:field=>"date",
:value=>"201407022000.log:2014-07-02 20:00",
:exception=>java.lang.IllegalArgumentException: Invalid format:
"201407022000.log:2014-07-02 20:00" is malformed at
"000.log:2014-07-02 20:00", :level=>:warn}
Unfortunately this is not working logstash fails parsing.
The reason it is failing is that you are trying to parse this:
201407022000.log:2014-07-02 20:00
With a filter that would match the format:
"YYYY-MM-dd HH:mm"
What you could do would be to use a grok on that field before parsing it:
filter {
grok {
match => { "fulldate" => "[0-9.]+log:%{TIMESTAMP_ISO8601:date}" }
}
}
If you also want to capture the filename at the start, you could create a new pattern like (that would go in a file in your pattern directory, normally /opt/logstash/patterns in UNIX based systems) this:
LOGFILENAMEPATTERN [0-9.]+log
Then your grok would become:
filter {
grok {
match => [ "fulldate" => "%{LOGFILENAMEPATTERN:filename}:%{TIMESTAMP_ISO8601:date}" ]
}
}
Finally your date pattern would become:
date {
match => [ "date", "yyyy-MM-dd HH:mm" ]
timezone => "Europe/Berlin"
}
Note, that I have changed the name of the field you are matching against, as I renamed it in the grok, and I have replaced YYYY with yyyy, as Y is Year of Era, y is year, they are not the same (according to the documentation)

Resources