feed a custom binary format to logstash

Question

I've got some monthly binary log files that I'd like to send to logstash (or possibly fluentd).

The issue I'm having is that (TTBOMK) the bin files are not readable by logstash so I would need to one of these.

Which of these options is the best way to read a custom bin file into logstash?

read log file via a script I've got in nodejs
rewrite the script as a plugin to logstash
"translate" the binary log files to readable text copies
or some other way I"m not aware of

I've set up a nodejs based js script that can read a binary file and create readable text version of the document. It can be run as CLI or an http service and return only lines after a set line number. Is it possible to integrate this with logstash directly, or indirectly (so that would not require me to rewrite the code).

If not, is re-writing the script as a logstash plugin worthwhile?

If option 1 would not work, and option 2 would take too much time to implement, I'm considering generating text versions. Because of the size of the resulting documents being several GB, I'd like to remove the files, or if possible parts of the file that have been read already. Is there any way to get feedback from logstash as to what has been read already?

PS I'm running on Windows Server, if it makes any difference

Alain Collins · Accepted Answer

You threw out a lot of details, so hopefully I have them all straight.

If you have an http service, logstash has a http_poller input that can poll it.

I would not recommend writing a plugin for logstash. Things continue to change to rapidly in that ecosystem.

Creating plain text files is the easiest idea from a logstash perspective. Logstash doesn't tell you explicitly that it has processed a file, but you can look it up in the registry (in unix, a file named ".sincedb*", typically in /var/lib/logstash, which contains the inode number and a file size offset) to see if the file has been 100% processed.

There are lots of other ways to feed input to logstash, including tcp/ucp inputs or brokers (rabbit, redis, etc) which might fit into your workflow.

There may be Windows-related caveats to all of this, of course.

feed a custom binary format to logstash

Answers (2)

Related Questions