AI Coding BATTLE | Which Open Source Model is BEST?

Introduction

In the realm of local coding without internet access, three open-source coding models are being put to the test to determine which one reigns supreme. The contenders are Deep Sea Coder V2, Yoder 9B, and Quen 2.5 Coder 7B, each boasting similar sizes and capabilities. This article explores their performance across a series of programming challenges, all while utilizing a powerful Dell Precision 5860 equipped with dual Nvidia RTX A6000 GPUs, allowing for a remarkable 96 GB of VRAM.

Initial Setup and Testing Environment

The testing environment is set up with all three models loaded simultaneously, providing a unique opportunity to compare their performance in real-time. The first task involves creating a simple Snake game in Python, showcasing each model's efficiency and coding abilities.

Deep Sea Coder V2 Performance

Starting with Deep Sea Coder V2, the model successfully generates code using the tkinter library for the Snake game. It performs at about 30 tokens per second. However, the output requires adjustments to work properly, leading to a less than satisfactory experience.

Yoder 9B Performance

Next, Yoder 9B is tested, which shows an increase in speed at 50 tokens per second and utilizes the turtle library. This version manages to execute correctly, although it experiences minor graphical alignment issues. Nevertheless, it offers a more functional experience than the first model.

Quen 2.5 Coder 7B Performance

Finally, Quen 2.5 Coder 7B demonstrates superior speed at nearly 70 tokens per second and successfully employs pygame for better performance. It creates a fully functional Snake game that manages errors effectively when the player collides with the snake’s own body.

Tetris Challenge

The models are then tested again with a more complex challenge of creating the Tetris game. Unfortunately, none of the models succeeded; Deep Sea Coder V2 struggled with missing library references, while both Yoder 9B and Quen 2.5 Coder 7B produced non-functioning code.

Coding Challenges with CodeWars

Shifting focus to coding challenges from CodeWars, the models are queried with difficulties ranging from straightforward tasks, like moving letters in a string, to complex issues like generating prime numbers. Interestingly, all three models successfully completed the simpler task but failed to meet time constraints for the more complex challenges, demonstrating performance limitations under heavy workloads.

Median of Two Sorted Arrays

The concluding challenge involves calculating the median of two sorted arrays, a task all three models handled effectively after some adjustments.

Conclusion

In a showdown among the three coding models, Quen 2.5 Coder 7B stands out for its speed and capability, particularly excelling at the Snake game. While all models showcased some strengths in simpler tasks, no one was consistent enough to handle complex challenges within time limits.

Ultimately, if users are seeking an efficient tool for local coding without internet reliance, Quen 2.5 offers a compelling choice, especially when combined with a capable machine like the Dell Precision 5860.

Keywords

AI, coding models, Deep Sea Coder V2, Yoder 9B, Quen 2.5 Coder 7B, local coding, Snake game, Tetris, CodeWars, prime number generator, machine learning, programming challenges.

FAQ

Q: What are the three open-source coding models being tested?
A: The three models are Deep Sea Coder V2, Yoder 9B, and Quen 2.5 Coder 7B.

Q: Which model performed the best in creating the Snake game?
A: Quen 2.5 Coder 7B performed the best, executing the game efficiently and accurately.

Q: Did any models succeed in creating the Tetris game?
A: No, all three models failed to create a functional Tetris game.

Q: What coding challenges were explored in the article?
A: The models were tested with challenges from creating a Snake game, Tetris, moving letters in a string, generating prime numbers, and finding the median of two sorted arrays.

Q: Which model is recommended for local coding without internet?
A: Quen 2.5 Coder 7B is recommended for its speed and functionality in local coding scenarios.