Ruby data formatting

Question

I'm reading a log file and trying to organize the data in the below format, so I wanted to push NAME(i.e USOLA51, USOLA10..) as hash and create corresponding array for LIST and DETAILS. I've created the hash too but not sure how to take/extract the corresponding/associated array values.

Expected Output

NAME           LIST             DETAILS

USOLA51        ICC_ONUS         .035400391
               PA_ONUS          .039800391
               PA_ONUS          .000610352

USOLA10        PAL               52.7266846
              CFG_ONUS           15.9489746
likewise for the other values

Log file:

--- data details ----

USOLA51

ONUS                    size
------------------------------ ----------
ICC_ONUS               .035400391
PA_ONUS            .039800391
PE_ONUS            .000610352

=========================================


---- data details ----


USOLA10


ONUS                    size
------------------------------ ----------
PAL                52.7266846
CFG_ONUS               15.9489746


=========================================

---- data details ----


USOLA55


ONUS                    size
------------------------------ ----------
PA_ONUS            47.4707031
PAL              3.956604
ICC_ONUS               .020385742
PE_ONUS            .000610352


=========================================


---- data details ----

USOLA56

ONUS                    size
------------------------------ ----------

=========================================

what I've tried

unique = Array.new
owner = Array.new
db = Array.new
File.read("mydb_size.log").each_line do |line|
  next if line =~ /---- data details ----|^ONUS|---|=======/   
  unique << line.strip if line =~ /^U.*\d/ 

end

hash = Hash[unique.collect { |item| [item, ""] } ]

puts hash

Current O/p

{"USOLA51"=>"", "USOLA10"=>"", "USOLA55"=>"", "USOLA56"=>""}

Any help to move forward would be really helpful here.Thanks !!

wiesion · Accepted Answer

It's been a long time i've been working with ruby, so probably i forgot a lot of the shortcuts and syntactic sugar, but this file seems to be easily parseable without great efforts.

A simple line-by-line comparison of expected values will be enough. First step is to remove all surrounding whitespaces, ignore blank lines, or lines that start with = or -. Next if there is only one value, it is the title, the next line consists of the column names, which can be ignored for your desired output. If either title or column names are encountered, move on to the next line and save the following key/value pairs as ruby key/value pairs. During this operation also check for the longest occurring string and adjust the column padding, so that you can generate the table-like output afterwards with padding.

# Set up the loop
merged = []
current = -1
awaiting_headers = false
columns = ['NAME', 'LIST', 'DETAILS']
# Keep track of the max column length
columns_pad = columns.map { |c| c.length }

str.each_line do |line|
  # Remove surrounding whitespaces, 
  # ignore empty or = - lines
  line.strip!
  next if line.empty?
  next if ['-','='].include? line[0]
  # Get the values of this line
  parts = line.split ' '
  # We're not awaiting the headers and 
  # there is just one value, must be the title
  if not awaiting_headers and parts.size == 1
    # If this string is longer than the current maximum
    columns_pad[0] = line.length if line.length > columns_pad[0]
    # Create a hash for this item
    merged[current += 1] = {name: line, data: {}}
    # Next must be the headers
    awaiting_headers = true
    next
  end
  # Headers encountered
  if awaiting_headers
    # Just skip it from here
    awaiting_headers = false
    next
  end
  # Take 2 parts of each (should be always only those two) 
  # and treat them as key/value
  parts.each_cons(2) do |key, value|
    # Make it a ruby key/value pair
    merged[current][:data][key] = value 
    # Check if LIST or DETAILS column length needs to be raised
    columns_pad[1] = key.length if key.length > columns_pad[1]
    columns_pad[2] = value.length if value.length > columns_pad[2]
  end
end

# Adding three spaces between columns
columns_pad.map! { |c| c + 3}  

# Writing the headers
result = columns.map.with_index { |c, i| c.ljust(columns_pad[i]) }.join + "
"

merged.each do |item|
  # Remove the next line if you want to include empty data
  next if item[:data].empty?  
  result += "
"
  result += item[:name].ljust(columns_pad[0])
  # For the first value in data, we don't need extra padding or a line break
  padding = ""
  item[:data].each do |key, value|
    result += padding
    result += key.ljust(columns_pad[1])
    result += value.ljust(columns_pad[2])
    # Set the padding to include a line break and fill up the NAME column with spaces
    padding = "
" + "".ljust(columns_pad[0])
  end
  result += "
"
end

puts result

Which will result in

NAME      LIST       DETAILS      

USOLA51   ICC_ONUS   .035400391   
          PA_ONUS    .039800391   
          PE_ONUS    .000610352   

USOLA10   PAL        52.7266846   
          CFG_ONUS   15.9489746   

USOLA55   PA_ONUS    47.4707031   
          PAL        3.956604     
          ICC_ONUS   .020385742   
          PE_ONUS    .000610352

Online demo here

Ruby data formatting

Answers (2)

Related Questions