Both ChatGPT 4o and Claude 3.5 Sonnet can identify the generated page content as... | Hacker News

Hacker Newsnew | past | comments | ask | show | jobs | submit

		sedatk on Jan 17, 2025 \| parent \| context \| favorite \| on: Nepenthes is a tarpit to catch AI web crawlers Both ChatGPT 4o and Claude 3.5 Sonnet can identify the generated page content as "random words".

tlonny on Jan 17, 2025 [–]

Given the size of the training data - I don’t think it would economical to validate all training data with high-end LLM models.

sedatk on Jan 17, 2025 | [–]

True. Maybe it can be dumbed down to a low-end model specifically for this type of detection.

Guidelines | FAQ | Lists | API | Security | Legal | Apply to YC | Contact