

58·
17 days agono u


no u


Good grief, the irrational ai kneejerk hate in this community is insane. This seems like a perfect use case - a code base with good test coverage and well defined output expectations, where a human has guided the translation and checked the results. The human in question has saved a lot of time. And still all the comments are “hurr durr slop amirite”. SMH fucking head.
GLM 4.5 is from August. Isn’t the real tl;dr that a seven month old open model, which was behind proprietary models at the time, did better than most humans would?