Too Long; Didn't Read
New York City taxi rides are probably the most commonly used benchmark in the area of data analytics. Data collection constitutes 1.7 billion taxi and for-hire vehicle (Uber, Lyft, etc.) trips originating in NYC since 2009. The data collection record includes a lot of different attributes of a taxi ride: Pickup date and time, pickup date and dropoff location names, wind speed, snow depth and tip amount. It can be used mostly for testing queries, but it also includes a couple of full-text fields that can also be used to test free text capabilities of databases.