6:[["$","$Le",null,{}],["$","div",null,{"className":"min-h-screen bg-gray-100 p-6","children":[["$","$Lf",null,{}],["$","script",null,{"type":"application/ld+json","dangerouslySetInnerHTML":{"__html":"{\"@context\":\"https://schema.org\",\"@type\":\"QAPage\",\"mainEntity\":{\"@type\":\"Question\",\"name\":\"POS tagging in nltk\",\"text\":\"

Hi is there an efficient way for tagging parts of speech in very large files?

\\n\\n

 import pandas as pd\\n import collections \\n import nltk \\n\\n tokens=nltk.word_tokenize(pandas_dataframe)\\n tag1=nltk.pos_tag(tokens)\\n counts=collections.counter([y for x,y  in tag1])\\n

\\n\\n

I am trying to find the most common parts of speech in a file and don't know of a better way of doing this

\\n\",\"author\":{\"@type\":\"Person\",\"name\":\"runningbirds\"},\"upvoteCount\":0,\"answerCount\":1,\"acceptedAnswer\":null}}"}}],["$","div",null,{"className":"bg-white shadow-md rounded-lg p-6 mb-6 relative","children":[["$","div",null,{"className":"absolute top-4 right-4 flex flex-wrap space-x-2","children":[["$","span","python",{"className":"bg-blue-600 text-white text-sm px-3 py-1 rounded-full","children":["$","$L10",null,{"href":"/discussion/tag/python/1","children":"python"}]}],["$","span","text",{"className":"bg-blue-600 text-white text-sm px-3 py-1 rounded-full","children":["$","$L10",null,{"href":"/discussion/tag/text/1","children":"text"}]}],["$","span","nltk",{"className":"bg-blue-600 text-white text-sm px-3 py-1 rounded-full","children":["$","$L10",null,{"href":"/discussion/tag/nltk/1","children":"nltk"}]}]]}],["$","div",null,{"className":"flex items-center mb-4","children":[["$","img",null,{"src":"https://www.gravatar.com/avatar/d5bba1829d9153d8cae1fc8ccdddcd11?s=256&d=identicon&r=PG&f=y&so-version=2","alt":"runningbirds","className":"w-16 h-16 rounded-full border"}],["$","div",null,{"className":"ml-4","children":[["$","a",null,{"href":"https://stackoverflow.com/users/3788557/runningbirds","target":"_blank","rel":"noopener noreferrer","className":"text-lg font-semibold text-blue-600 hover:underline","children":"runningbirds"}],["$","p",null,{"className":"text-sm text-gray-500","children":["Reputation: ",6605]}]]}]]}],["$","h1",null,{"className":"text-2xl font-bold text-gray-800 mb-4","children":"POS tagging in nltk"}],["$","p",null,{"className":"text-gray-700 mt-4","dangerouslySetInnerHTML":{"__html":"

Hi is there an efficient way for tagging parts of speech in very large files?

\n\n

 import pandas as pd\n import collections \n import nltk \n\n tokens=nltk.word_tokenize(pandas_dataframe)\n tag1=nltk.pos_tag(tokens)\n counts=collections.counter([y for x,y  in tag1])\n

\n\n

I am trying to find the most common parts of speech in a file and don't know of a better way of doing this

\n"}}],["$","div",null,{"className":"text-gray-600 text-sm mt-4","children":[["$","p",null,{"children":["Upvotes: ",0]}],["$","p",null,{"children":["Views: ",286]}]]}]]}],["$","div",null,{"className":"container mx-auto","children":[["$","h2",null,{"className":"text-2xl font-semibold text-gray-800 mb-6","children":["Answers (",1,")"]}],[["$","div","25749213",{"className":"bg-white shadow-md rounded-lg p-6 mb-6","children":[["$","div",null,{"className":"flex items-center mb-4","children":[["$","img",null,{"src":"https://www.gravatar.com/avatar/23f8c21124fb966ce1fbdca5a3b6f29e?s=256&d=identicon&r=PG","alt":"leavesof3","className":"w-12 h-12 rounded-full border"}],["$","div",null,{"className":"ml-4","children":[["$","a",null,{"href":"https://stackoverflow.com/users/1052965/leavesof3","target":"_blank","rel":"noopener noreferrer","className":"text-lg font-semibold text-blue-600 hover:underline","children":"leavesof3"}],["$","p",null,{"className":"text-sm text-gray-500","children":["Reputation: ",431]}]]}]]}],["$","p",null,{"className":"text-gray-700 mb-4","dangerouslySetInnerHTML":{"__html":"

Typically you need to get around the for loop, possible high memory load and possible high CPU load.

\n\n

Here's an example of distributed part of speech tagging using python and execnet.

\n"}}],["$","div",null,{"className":"text-gray-600 text-sm","children":["$","p",null,{"children":["Upvotes: ",1]}]}]]}]]]}],["$","div",null,{"className":"bg-white shadow-md rounded-lg p-6 mt-6","children":[["$","h2",null,{"className":"text-2xl font-semibold text-gray-800 mb-4","children":"Related Questions"}],["$","ul",null,{"className":"list-disc list-inside","children":[["$","li","1639855",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/1639855","className":"text-blue-600 hover:underline","children":"POS tagging in German"}]}],["$","li","47519987",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/47519987","className":"text-blue-600 hover:underline","children":"How to use pos_tag in NLTK?"}]}],["$","li","35475677",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/35475677","className":"text-blue-600 hover:underline","children":"Custom POS tagging with NLTK (error)"}]}],["$","li","50145355",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/50145355","className":"text-blue-600 hover:underline","children":"NLTK POS tagging using my own tagged corpus?"}]}],["$","li","46636047",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/46636047","className":"text-blue-600 hover:underline","children":"POS tags of rows of data set with NLTK"}]}],["$","li","36031018",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/36031018","className":"text-blue-600 hover:underline","children":"NLTK PoS tagging"}]}],["$","li","8365557",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/8365557","className":"text-blue-600 hover:underline","children":"pos_tag in NLTK does not tag sentences correctly"}]}],["$","li","21786257",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/21786257","className":"text-blue-600 hover:underline","children":"python NLTK POS tagger not behaving as expected"}]}],["$","li","20662286",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/20662286","className":"text-blue-600 hover:underline","children":"nltk pos tagger looks to incorporate '.'"}]}],["$","li","8146748",{"className":"mb-2","children":["$","$L10",null,{"href":"/discussion/solution/8146748","className":"text-blue-600 hover:underline","children":"How to obtain better results using NLTK pos tag"}]}]]}]]}]]}],["$","$L11",null,{}],["$","$L12",null,{}],["$","$L13",null,{}],["$","$L14",null,{}],["$","$L15",null,{}]]

POS tagging in nltk

Answers (1)

Related Questions