Tell me who your followers are and I tell you who you are.
You have lots of followers, congrats! You're popular, maybe an influencer. But, do you actually know who's following you?
If you have thousands of followers, that could be a tricky question to answer. Let's use Machine Learning to try to answer that question.
MonkeyLearn's Twitter account as today (Oct 4th, 2016), has almost 20,000 followers. It would be great to know which kind of people follow us, find out more about their interests, who they are, what they do, and try to figure out why they follow us. That could give us useful insights about our user base and product.
We stored all the data in a simple CSV file. First column: user handle, second column: user bio. Like this:
|edp3rez||Actively seeking to become wiser every day and attract people with a similar mindset so we can find ways to build a new world. Is that you?|
|FintechArticle||Fintech Articles and News Analysis from around the web|
|MFENOGLIO||Don't forget to stop and have fun from time to time . Si luchas por lo que realmente quieres vendra a ti. A buenas el mejor|
|michaelyoungMBN||Founder/CEO @ https://t.co/vduvBZ6wHZ @mbnsolutions - Managing Partner @ https://t.co/OYk3xwFeyn @mbnconsilium - Founder of @DataSciTechScot|
|brolouiemd||fashionate seasonal web developer, wordpress developer|
|vexipoloxozo||Tнe 2015 Sαle ιѕ нere! yoυ cαɴ ɴow вυy 50,000 Twιттer Followerѕ ғor oɴly $146, Try ιт ɴow! αт https://t.co/pB6BOGQx3T|
|PetroSemeniuk||Developer. Developer. Developer.|
|SaurabhIAm||I am a Techno Freak, A Computer Science Geek, App. Developer, A Web Developer, A Kickass Gamer, and A Painting Artist!|
|krkdev||Developer and Debugger|
You can execute the command like this:
python keywords_csv.py -s 4000 -c 1 -k 100 -t <<YOUR TOKEN HERE>> followers_bios.csv
-s option sets the max number of rows (bios) to use. I suggest to limit to the last 4,000 texts, too much of them would take a lot of time to process.
-c option sets the column number (starting on 0) where the bios are located in the CSV file.
-k option sets the max number of keywords to return.
-t option sets your MonkeyLearn API token.
And lastly, the followers_bios.csv is the CSV file where you stored the bios.
The keywords returned will be sorted according to their relevance within the texts.
You can even try to do the same process just copying and pasting the texts within MonkeyLearn's GUI, just go to the API section. This will limit just to the top 10 keywords.
And the results for the top 100 keywords associated with MonkeyLearn's followers are:
That's great! Definitively what we wanted to see, but we also found some interesting insights:
Hope you enjoyed this quick post, I'd love to know your own insights with your followers!
October 4th, 2016