paint-brush
Collecting Data from 1.1M Hacker News Curated Commentsby@snikolaev
313 reads
313 reads

Collecting Data from 1.1M Hacker News Curated Comments

by Sergey Nikolaev12mJune 10th, 2022
Read on Terminal Reader
Read this story w/o Javascript
tldt arrow

Too Long; Didn't Read

In this test we use the data collection of 1.1M Hacker News curated comments with numeric fields from <https://://://zenodo.org/record/45901. In the modern world 1 million of documents can be considered a very small data set which can be typical for many applications: blogs and news sites, online stores, job, automotive and real estate sites and so on. We have made this test available for 4 databases: Clickhouse, Manticore Search, Elasitcsearch, Clickhouse and Elasticsearch. We've tried to make as little changes to database default settings as possible.

Company Mentioned

Mention Thumbnail
featured image - Collecting Data from 1.1M Hacker News Curated Comments
Sergey Nikolaev HackerNoon profile picture
Sergey Nikolaev

Sergey Nikolaev

@snikolaev

Database expert. Passionate about databases and search engines.

About @snikolaev
LEARN MORE ABOUT @SNIKOLAEV'S
EXPERTISE AND PLACE ON THE INTERNET.
L O A D I N G
. . . comments & more!

About Author

Sergey Nikolaev HackerNoon profile picture
Sergey Nikolaev@snikolaev
Database expert. Passionate about databases and search engines.

TOPICS

THIS ARTICLE WAS FEATURED IN...

Permanent on Arweave
Read on Terminal Reader
Read this story in a terminal
 Terminal
Read this story w/o Javascript
Read this story w/o Javascript
 Lite