1 min readfrom Machine Learning

Just did an analysis on ICLR 2025 vs 2026 scores and WOW [D]

Just did an analysis on ICLR 2025 vs 2026 scores and WOW [D]
Just did an analysis on ICLR 2025 vs 2026 scores and WOW [D]

Per https://paperreview.ai/tech-overview, the scores corr between 2 human is about 0.41 for ICLR 2025, but in my current project I am seeing a much lower corr for ICLR 2026. So I ran the metrics for both 2025 and 2026 and it is crazy. I used 2 metrics, one-vs-rest corr and half-half split corr. All data are fetched from OpenReview.

I do know that top conf reviews are just a lottery now for most papers, but i nenver thought it is this bad.

2025 avg-score SD: 1.253, mean wavg-scoreer human SD: 1.186

2026 avg-score SD: 1.162, mean within-paper human SD: 1.523

https://preview.redd.it/klay6nijipug1.png?width=2090&format=png&auto=webp&s=92c85470bc72ff03584f38f160d3d09f530b55e2

  • 2025 avg-score SD: 1.253, mean within-paper human SD: 1.186
  • 2026 avg-score SD: 1.162, mean within-paper human SD: 1.523
submitted by /u/Striking-Warning9533
[link] [comments]

Want to read more?

Check out the full article on the original site

View original article

Tagged with

#generative AI for data analysis
#Excel alternatives for data analysis
#natural language processing for spreadsheets
#conversational data analysis
#data analysis tools
#rows.com
#big data management in spreadsheets
#real-time data collaboration
#intelligent data visualization
#data visualization tools
#enterprise data management
#big data performance
#data cleaning solutions
#ICLR
#scores
#correlation
#2025
#2026
#human SD
#average score