Hackers' Pub

Syntax	Description	Examples
`"` keyword `"`	Finds the string within quotes, including spaces. Case-insensitive. (Escape quotes inside with `\"`)	`"Hackers' Pub"`
`from:` handle	Finds content written by the specified user.	`from:hongminhee` `from:hongminhee@hollo.social`
`lang:` ISO 639-1	Finds content written in the specified language.	`lang:en`
`#` tag	Finds content with the specified tag. Case-insensitive.	`#HackersPub`
condition condition	Finds content that satisfies both conditions on either side of the space (logical AND).	`"Hackers' Pub" lang:en`
condition `OR` condition	Finds content that satisfies at least one of the conditions on either side of the OR operator (logical OR).	`#HackersPub OR "Hackers' Pub" lang:en`
`(` condition `)`	Combines the operators within the parentheses first.	`(#HackersPub OR "Hackers' Pub" OR "Hackers Pub") lang:en`

Jodie Burchell 🇦🇺🇩🇪 @t_redactyl@fosstodon.org

1/13/2026, 3:30:03 PM

Public

Have you ever looked at the impressive results that LLMs get on benchmarks or difficult exams and wondered if these results are everything they seem? Some of these are due to something called data leakage, where assessments leak into the training data and the models memorise the answers.

If you'd like to learn more about how this classic pitfall of machine learning calls many of the results we see on LLM performance into question, check out my latest blog post.

https://t-redactyl.io/posts/2025-12-30-data-leakage-llm-measurement/

Data leakage is a major issue when measuring LLM performance

Why data leakage and benchmark contamination distort LLM performance claims, from coding puzzles to the LM Arena and training data exhaustion.

t-redactyl.io

If you have a fediverse account, you can quote this note from your own instance. Search https://fosstodon.org/users/t_redactyl/statuses/115888501770326823 on your instance and quote it. (Note that quoting is not supported in Mastodon.)