I tried letting "Claude Code" solve a coding problem for me.
It was iterating, trying edits, running the tests, revising the code. But then when it couldn't get the tests pass to after a couple of iterations, it updated the test suite to just skip those tests, declaring that they were unreliable and not useful. π
The tests were passing before Claude got involved. Our jobs are safe for now.