Why does repeated JSON parsing consume more and more memory?

Question

It seems that parsing the same JSON file over and over again in Ruby uses increasingly larger amounts of memory. Consider the code and the output below:

Why isn't the memory freed up after the first iteration?
Why does a 116MB JSON file take up 1.5Gb of RAM after parsing? It's surprising considering the text file is converted into hashes. What am I missing here?

Code:

require 'json'

def memused
  `ps ax -o pid,rss | grep -E "^[[:space:]]*#{$$}"`.strip.split.map(&:to_i)[1]/1024
end

text = IO.read('../data-grouped/2012-posts.json')
puts "before parsing: #{memused}MB"
iter = 1
while true
  items = JSON.parse(text)
  GC.start
  puts "#{iter}: #{memused}MB"
  iter += 1
end

Output:

before parsing: 116MB
1: 1840MB
2: 2995MB
3: 2341MB
4: 3017MB
5: 2539MB
6: 3019MB

Why does repeated JSON parsing consume more and more memory?

Answers (1)

Related Questions