kkgarg
kkgarg

Reputation: 1376

Retrieving full text using twarc2

I am retrieving tweets using twarc2 with search terms in the following way:

twarc2 search --archive --start-time "2015-01-01" --end-time "2018-12-31" --limit 25000 "faith OR #faith" results.jsonl

But the resultant tweets are truncated after a certain length. E.g. RT @AndrewYNg: We cannot abdicate responsibility when two children, ages 7 and 8, die in US custody. The US once said: "Give me your tired,… although the tweet is a bit longer. I read the twarc2 documentation but can't find any "extended" tweet_mode option for retrieving the full_text. Any help on this will be appreciated.

Upvotes: 0

Views: 198

Answers (1)

script_kitty
script_kitty

Reputation: 771

Are all the retrieved tweets truncated? The example you provided is a retweet (includes "RT"). RT are truncated but their original tweet is full text.

You can exclude retweets in twarc2. Try adding the below to your command:

-is:retweet 

Hope that helps.

Upvotes: 0

Related Questions