How can you tell when a data scientist is just showing off (or has too much time on his or her hands)?
One clue might be when they start comparing vocabularies of Shakespeare and today’s (or more accurately, 20 years ago’s) most popular rappers.
That’s exactly what Matt Daniels has done in his effort to reveal who has “the largest vocabulary in hip hop.” It’s actually pretty cool how Daniels did this. He took the first 35,000 words that appear in seven of William Shakespeare’s works, along with the first 35,000 words of Melville’s "Moby Dick," and compared that to each artists’ first 35,000 lyrics.
Some key findings:
Shakespeare used 5,170 different words, compared to Melville’s 6,022 words
- Yes, the Wu-Tang Clan, with 5,895 words, beat Shakespeare
Click here to see Daniels’ full interactive chart
Aesop Rock smashes everybody with 7,392 words used
of rapper vocabulary sizes, along with several breakdowns and data manipulations. And please refer to it when you win a bar argument over who has the bigger vocabulary—Shakespeare or the Wu-Tang Clan.