Road to Proto-AGI 2024

Introduction

As we approach the final quarter of 2023, it’s worth pausing to consider the likely trajectory of artificial intelligence, particularly in the context of its potential convergence toward Artificial General Intelligence (AGI). This post will attempt to make probabilistic predictions for the technological and social landscape of AI in 2024. Given the rapid advancements in multimodal capabilities, code interpretation in models like GPT-4, we stand at the brink of what might be a seismic shift in AI capabilities.

The Near Future:

Autonomous AI and Corporate Rivalry

GPT-4, released by OpenAI, serves as an important milestone. It has vision capabilities and can interpret code, two significant steps toward higher-level cognitive abilities. What’s even more intriguing is the notion that OpenAI experimented with long-term memory [1] in its AI models but hasn’t released it officially yet. If all these components are integrated into a unified, autonomous agent, we may be looking at a proto-AGI.

Many seem to have forgotten that German Microsoft CTO leaked details about GPT-4′s capabilities, hinting at its potential for video understanding. [2] This raises an interesting question: Is OpenAI deliberately constraining the true capabilities of their technology?

On the other hand, Google has had ample time to study GPT-4 and will release its own exceptional AI model, tentatively called “Gemini.” It could either be a leap forward in multimodal understanding or a complete disappointment.

Predictions for 2024

The Arrival of Meta’s Llama 3: Meta will release an opensource AI model, Llama 3, with about 90 billion parameters, equivalent to ChatGPT 3.5 and even better at some tasks. This will be a major competitive move against OpenAI’s models.
Google Gemini: Will come in several sizes. Most of them a free to use, as they will get incorporated in Google products anyways. There will be a very big one that you can use as a subscriber (like ChatGPT Plus), and the biggest one will be available with a waiting list and not yet opened to the public. The biggest (closed) one will be at the level of OpenAIs 2024 model, while the one you can subscribe to will be around GPT-4 level, but with more capabilities right from the start.
OpenAI’s Next Model: In the wake of Google’s Gemini (be it a success or a failure), OpenAI will be highly motivated to push the boundaries. Their next model will likely be more agent-like, with long-term memory and task-execution capabilities, although it will still lack the precision and speed of a human agent. Release will be around March +/- 1 month. It will NOT be called GPT-5 or GPT-4.5, since it isn’t a mere GPT anymore. It will use a more catchy name. I call it proto-AGI, since it satisfies some aspects of AGI, but is still less accurate and slower than a median human.
Industrial Humanoid Robots: Several robotic companies will start beta deployments of humanoid robots in industrial settings, augmenting or replacing human labor in certain tasks.
Open Source Catching Up: Open source models will approach the capabilities of DALL-E 3 in text-to-image generation, adding the benefit of being fine-tunable.
Text-to-Video Advances: We will see significant improvements in text-to-video technologies, particularly in narrow domains (poor generalization still, good quality and specific domains)
Rise of Synthetic Datasets: AI training will increasingly rely on large, synthetic datasets, although these may contain biases that are harder to detect and correct.
Medical Breakthrough: AI will contribute to at least one major breakthrough in the medical field.
Public Sentiment: With AI becoming pervasive, public sentiment will grow more polarized. Anti-AI groups will start to form, though the majority of people will embrace the conveniences offered by AI.
AI in Gaming: Indie games will begin to incorporate AI components like conversational NPCs, although AAA games will not adopt this trend just yet.

Counterarguments and Uncertainties While these predictions are based on current trends and reasonable extrapolations, it’s important to note that the AI field is highly uncertain. Market pressures, ethical considerations, and unforeseen technical challenges could all deviate the course.

Conclusion The year 2024 promises to be a significant year for AI, possibly setting the stage for even more radical changes. While we can’t predict the future with certainty, it’s important to rationally analyze the indicators we have to prepare for the likely scenarios that will unfold.

[1] https://openai.com/blog/chatgpt-plugins

[2] https://www.businessinsider.com/openais-gpt-4-means-chatgpt-text-into-video-microsoft-cto-2023-3?amp