Unfortunately, it’s difficult to tell. The existing products aren’t widely available to use and test. The most widely used benchmark, SWE-Bench Lite, has real-world issues and patches from GitHub. However, the patches in SWE-Bench Lite only changes 10 lines of code per patch! This rises to 37 lines of code for the full SWE-Bench, but neither are reflective of complex software engineering tasks.
To understand the limits of AI for software development, we’ve been exploring how AI can be used for code using real-world tasks. In this blog post, we’ll describe how we used AI to convert (a part) of a popular Python library to C++ for speed improvements.
What we’ve found is that AI is quite good at writing large volumes of code, but it suffers from precision. Unfortunately (or fortunately?), AI isn’t here to replace humans yet. Read on to see why! And connect with us if you’re interested in fast Python or understanding AI and code.
Python is one of the most widely used languages - and for good reason. It’s extremely flexible, which allows for fast development. However, it’s also extremely slow. C++ is nearly the opposite. It allows fine-grained control of memory and the final compiled code. Unfortunately, it’s considered difficult to use.