The Movie Corpus The Movies Corpus contains 200 million words of data in more than 25,000 movies from the 1930s to the current time. All of the 25,000+ movies are tied in to their IMDB entry, which means that you can create Virtual Corpora using extensive metadata — year, country, rating, genre, etc.
The TV Corpus The TV Corpus contains 325 million words of data in 75,000 TV episodes from the 1950s to the current time. All of the 75,000 episodes are tied in to their IMDB entry, which means that you can create Virtual Corpora using extensive metadata — year, country, series, rating, genre, etc.
List: 5 Successful People Take Us Through Their Morning Routines – McSweeney’s Internet Tendency 5 – 6 AM: I wake up and scream. I reflect on the ants that have infected my bed and also my dreams. I write their messages in my journal and decide on the three hot dogs I will eat that day. Then I go into the bathroom where the ants have organized my magazines. I read a New Yorker poem out loud to no one.