Hacker News
new
|
past
|
comments
|
ask
|
show
|
jobs
|
submit
login
sedatk
on Jan 17, 2025
|
parent
|
context
|
favorite
| on:
Nepenthes is a tarpit to catch AI web crawlers
Both ChatGPT 4o and Claude 3.5 Sonnet can identify the generated page content as "random words".
tlonny
on Jan 17, 2025
[–]
Given the size of the training data - I don’t think it would economical to validate all training data with high-end LLM models.
sedatk
on Jan 17, 2025
|
parent
[–]
True. Maybe it can be dumbed down to a low-end model specifically for this type of detection.
Guidelines
|
FAQ
|
Lists
|
API
|
Security
|
Legal
|
Apply to YC
|
Contact
Search: