Toggle navigation
HN
Paper
All
Show
Ask
Jobs
Top stories
Today
Last 7 days
Last months
This year
Stats
Stories by Cynddl
Measuring What Matters: Construct Validity in Large Language Model Benchmarks
1 points
Cynddl
2025-11-11T11:44:07Z
arxiv.org
AI Capabilities May Be Overhyped on Bogus Benchmarks, Study Finds
43 points
Cynddl
2025-11-07T22:55:19Z
gizmodo.com
AI's capabilities may be exaggerated by flawed tests, according to new study
3 points
Cynddl
2025-11-06T15:24:21Z
www.nbcnews.com
Experts find flaws in tests that check AI safety and effectiveness
3 points
Cynddl
2025-11-04T11:28:01Z
www.theguardian.com
Measuring What Matters: Construct Validity in Large Language Model Benchmarks
3 points
Cynddl
2025-11-04T10:34:15Z
oxrml.com
The quiet software tooling Renaissance
3 points
Cynddl
2025-09-01T15:31:20Z
pdx.su
Facial recognition works better in the lab than on the street, researchers show
4 points
Cynddl
2025-08-20T13:18:28Z
www.theregister.com
We Shouldn't Trust Facial Recognition's Glowing Test Scores
2 points
Cynddl
2025-08-18T15:54:00Z
www.techpolicy.press
Training language models to be warm and empathetic makes them less reliable
358 points
Cynddl
2025-08-12T13:32:16Z
arxiv.org
AI's limited understanding of gender puts health equity at risk
4 points
Cynddl
2025-05-21T09:57:21Z
www.oii.ox.ac.uk
Establishing meaningful data access for algorithm audits
1 points
Cynddl
2025-02-17T23:35:08Z
syntheticsociety.oii.ox.ac.uk
Alpha Lyrae: This font 'randomly' pixelates characters in a block of text
1 points
Cynddl
2024-10-16T11:58:24Z
vegaprotocol.github.io
Data anonymity methods and privacy safeguards unfit for modern data
1 points
Cynddl
2024-07-18T09:30:09Z
www.oii.ox.ac.uk
Soundaktor: a vehicle audio system used to simulate engine noise in the cabin
1 points
Cynddl
2021-10-08T12:32:18Z
en.wikipedia.org
Free money, PS4, and train tickets
1 points
Cynddl
2019-12-26T19:01:30Z
rocher.lc
Facebook’s New Cryptocurrency Libra: Not to Be Confused with Libre
2 points
Cynddl
2019-06-19T17:49:42Z
privacyinternational.org
CMPs May Not Be GDPR Compliant
3 points
Cynddl
2019-05-03T10:13:43Z
adexchanger.com
French data protection watchdog fines Google $57M under the GDPR
2 points
Cynddl
2019-01-21T16:18:48Z
techcrunch.com
When the signal is in the noise: Exploiting Aircloak's Diffix anonymization
4 points
Cynddl
2018-04-25T13:46:47Z
cpg.doc.ic.ac.uk
Cambridge Analytica is only the beginning. Should you blame your friends for it?
2 points
Cynddl
2018-03-29T15:50:54Z
cpg.doc.ic.ac.uk
1