Reputation: 11
I have to analyze log files of a software which has run out of support (the software provider has ceased) to trace down an issue. Unfortunately, the log data is not stored in plain (human-readable) text and there exists no documentation on how to interpret the log files. That is why I would like to develop an algorithm to make the data human-readable - preferrably in Java or Python, but in the first step I need to understand the data structure.
Please find below the information I have collected so far and what I have already tried:
notepad++
& HxD
:LogFiles
and have filetype .log
. At each application start, a new log file is created which is used until termination of the application. The logging process is not the same as the application process but runs in parallel. It is a native process (not managed).01 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
(hex extracted with HxD
)a-z, A-Z, 0-9, +, /, =
)y+O+srwgzR
/ decoded hex CB E3 BE B2 BC 20 CD 12
.=
is used. For all these files, =
only occurs right before a string of 32 characters at the end of the file. These last 16 characters are also from the Base64 alphabet.After decoding, I checked the result string but it was a (not human-readable) string of seemingly random characters for all cases.
gzip
glib
Both approaches failed with an exception (e.g., for zlib
: Incorrect header check
) indicating that the decoded data is not compressed. Eventhough I would not expect log file data to be encrypted, I have no clue how to rule that out.
Netstat
did not reveal any ports of one of both processes, for STraceNT
and accesschk
I did not know how to manage all the data retrieved to make conclusions about shared memory use. I have to admit that I am not used to these tools and would need more guidance to use them if it makes sense for my problem.I'm thankful for any ideas on how to develop my assumptions further and on how to find a way to make the log-data human readable. If you need any more details, let me know!
Upvotes: 0
Views: 430