Trying to count specific strings from a .txt file (python)

Question

Actually I have an entry like this:

(observatório=astronómico, de o, universidade=de=coimbra)
(centro=de=astronomia, de o, universidade=do=porto=catarina=lobo)
(núcleo=interactivo=de=astronomia, em o,    centro=de=interpretação=ambiental=da=ponta=do=sal)
(câmara=municipal, de, cascais)
(câmara, de, nova=iorque)
(presidência, de o, pe)
(fortis, em, bruxelas)
(macquarie=futures, de o, eua)
(força=internacional=de=assistência=e=segurança, constituir o, força=de=reacção=rápida=do=comandante)
(forças=nacionais=destacadas, em o, afeganistão)
(nato, em o, afeganistão)
(nato, em o, afeganistão)

and need to count how many times a string repeats and output this in order to another .txt. I did it using dict, but was frustrating to .strip special characters.

# -*- coding: utf-8 -*-
# !/usr/bin/python
from Tkinter import Tk
from tkFileDialog import askopenfilename

Tk().withdraw() 
filename = askopenfilename()
file = open(filename, "r+")
wordcount = {}
for word in file.read().split():
     if word not in wordcount:
    wordcount[word] = 1
       else:
    wordcount[word] += 1
for k, v in wordcount.iteritems():
   print k, "=", v, "vez(es)"

any tip on how I may count it properly, and output it in a way anyone can read and know how many times a string(can be a line, because of the entry format) occurred?

Trying to count specific strings from a .txt file (python)

Answers (1)

Related Questions