Top model scores may be skewed by Git history leaks in SWE-bench
Link: https://github.com/SWE-bench/SWE-bench/issues/465
Discussion: https://news.ycombinator.com/item?id=45214670
Top model scores may be skewed by Git history leaks in SWE-bench
Link: https://github.com/SWE-bench/SWE-bench/issues/465
Discussion: https://news.ycombinator.com/item?id=45214670
If you have a fediverse account, you can quote this note from your own instance. Search https://social.lansky.name/users/hn100/statuses/115187417739958012 on your instance and quote it. (Note that quoting is not supported in Mastodon.)