paint-brush
Data Scraping: Do Large Language Models Cross Boundaries by Training on Content from Everyoneby@viggybala
214 reads

Data Scraping: Do Large Language Models Cross Boundaries by Training on Content from Everyone

by Viggy Balagopalakrishnan10mAugust 8th, 2023
Read on Terminal Reader
Read this story w/o Javascript
tldt arrow

Too Long; Didn't Read

While scraping enabled models to get where they are, cleanly sourced data is going to become more and important

People Mentioned

Mention Thumbnail
Mention Thumbnail
featured image - Data Scraping: Do Large Language Models Cross Boundaries by Training on Content from Everyone
Viggy Balagopalakrishnan HackerNoon profile picture
Viggy Balagopalakrishnan

Viggy Balagopalakrishnan

@viggybala

Product person at heart. Writing weekly in-depth analyses of tech/business topics at thisisunpacked.substack.com.

0-item
1-item

STORY’S CREDIBILITY

Original Reporting

Original Reporting

This story contains new, firsthand information uncovered by the writer.

Opinion piece / Thought Leadership

Opinion piece / Thought Leadership

The is an opinion piece based on the author’s POV and does not necessarily reflect the views of HackerNoon.

L O A D I N G
. . . comments & more!

About Author

Viggy Balagopalakrishnan HackerNoon profile picture
Viggy Balagopalakrishnan@viggybala
Product person at heart. Writing weekly in-depth analyses of tech/business topics at thisisunpacked.substack.com.

TOPICS

THIS ARTICLE WAS FEATURED IN...

Permanent on Arweave
Read on Terminal Reader
Read this story in a terminal
 Terminal
Read this story w/o Javascript
Read this story w/o Javascript
 Lite