Too Long; Didn't Read
Piotr Orzeszek is a self-motivated research engineer with experience in data science and serverless machine learning. We compare three Python libraries: Newspaper, Goose3 and news-please. We expect that libraries with fewer dependencies and smaller memory requirements will behave better in news extraction performance and bootstrapping time. The research relies on Python Cloud Importer, a solution for importing libraries directly from cloud storage and automating package optimization. The importer was developed as a part of the Cloud AI Operating System project overseen by BST LABS, the software engineering unit of BlackSwan Technologies.