Back to Blogs

Contents

Encord Blog

One Year of ChatGPT - Here’s What’s Coming Next

November 29, 2023

5 mins

Back to Blogs

Power your AI models with the right data

Automate your data curation, annotation and label validation workflows.

Get started

Contents

Written by

Eric Landau

View more posts

Before OpenAI was a producer of the most scintillating boardroom corporate drama outside of an episode of Succession, it was the creator of the universally known AI application ChatGPT. On the eve of the one-year anniversary of its launch(a whirlwind year of progress, innovations, and twists), it is worth revisiting the state of AI post-ChatGPT with a view towards looking forward.

A year ago, ChatGPT took the world by storm, smashing even OpenAI’s greatest expectations of adoption by becoming the world’s fastest-growing consumer app of all time. While the last year has been filled with a panoply of new models, hundreds of freshly minted startups, and gripping drama, it still very much feels like only the early days of the technology.

As the cofounder of an AI company, and having been steeped in the ecosystem for years, the difference this last year has made has been nothing short of remarkable—not just in technological progress or academic research (although the strides here have been dizzying) —but more in unlocking the public imagination and discourse around AI.

YCombinator, a leading barometer of the directionality of technological trends, has recently churned out batches where, for the first time, most companies are focused on AI. ChatGPT is now being used as a zinger in political debates. Once exotic terms like Retrieval Augmentation Generation are making their way into the vernacular of upper management in Fortune 500 companies. We have entered not a technological, but a societal change, where AI is palatable for mainstream digestion. So what’s next?

Seeing the future is easy, if you know where to look in the present: technological progress does not move as a uniform front where adoption of innovation propagates equally across all facets of society. Instead, it moves like waves crashing the jagged rocks of a coastline, splashing chaotically forward, soaking some while leaving others dry. Observing where the water hits first lets you guess what happens when it splashes on others later. It takes one visit to San Francisco to notice the eerily empty vehicles traversing the city in a silent yet conspicuous manner to preview what the future looks like for municipalities around the world—a world with the elimination of Uber driver small talk.

While making firm predictions in a space arguably moving forward faster than any other technological movement in history is a fool’s game, clear themes are emerging that are worth paying attention to by looking at the water spots of those closest to the waves. We are only one year into this “new normal,” and the future will have much more to bring along the following:

Dive Into Complexity

One of the most exciting aspects of artificial intelligence as a technology is that it falls into a category few technologies do: “unbounded potential.” Moore’s Law in the ‘60s gave a self-fulling prophecy of computational progress for Silicon Valley to follow. The steady march of development cycles has paved the way from room-sized machines with the power of a home calculator to all the marvellous wonders we take for granted in society today.

blog image

Similar to computation, there are no limits in principle for the cognitive power of computers across the full range of human capabilities. This can stoke the terrors of a world-conquering AGI, but it also brings up a key principle worth considering: ever-increasing intellectual power.

The AIs of today that are drawing boxes over cars and running segmentations over people will be considered crude antiquities in a few years. They are sub-component solutions used only as intermediate steps to tackle more advanced problems (such as diagnosing cancer, counting cars for parking tickets, etc). We must walk before we can run, but it is not difficult to imagine an ability to tackle harder and harder questions over time. In the future, AI will be able to handle problems of increasing complexity and nuance, ones that are currently limitations for existing systems.

While ChatGPT and other equivalent LLMs of today are conversant (and hallucinatory) in wide-ranging topics, they still cannot handle niche topics with reliability. Companies, however, have already begun tailoring these models with specialized datasets and techniques to handle more domain-specific use cases. With improved training and prompting, the emergence of AI professionals - such as doctors, paralegals, and claims adjusters - is on the horizon. We’re also approaching an era where these specialized applications, like a FashionGPT trained on the latest trends, can provide personalized advice and recommendations according to individual preferences.

We should expect a world where the complexity and nuance of problems, ones that are only available for particular domain experts of today, will be well within the scope of AI capabilities. Topics like advanced pathology, negotiating geopolitical situations, and company building will be problems within AI capacity. If the history of computers is any beacon, complexity is the direction forward.

Multi-modality

Right now, there are categorical boxes classifying different types of problems that AI systems can solve. We have “computer vision”, “NLP”, “reinforcement learning”, etc. We also have separations between “Predictive” and “Generative AI” (with a corresponding hype cycle accompanying the rise of the term). These categories are useful, but they are mostly in place because models can, by and large, solve one type of problem at a time. Whenever the categorizations are functions of technological limitations, you should not expect permanence; you should expect redefinitions.

Humans are predictive and generative. You can ask me if a picture is of a cat or a dog, and I can give a pretty confident answer. But I can also draw a cat (albeit badly). Humans are also multi-modal. I can listen to the soundtrack of a movie and take in the sensory details of facial expressions, body language, and voice in both semantic content as well as tonal and volume variations. We are performing complex feats of sensor fusion across a spectrum of inputs, and we can perform rather complex inferences from these considerations. Given that we can do this adeptly, we shouldn’t expect any of these abilities to be outside the purview of sufficiently advanced models.

The first inklings of this multi-modal direction are already upon us. ChatGPT has opened up to vision and can impressively discuss input images. Open-source models like LLaVA now reason over both text and vision. CLIP combines text and vision into a unified embedding structure and can be integrated with various types of applications. Other multimodal embedding agents are also becoming commonplace.

Check out my webinar with Frederik Hvilshøj, Lead ML Engineer at Encord, on “How to build Semantic Visual Search with ChatGPT & CLIP”.

While these multimodal models haven’t found use in many practical applications yet, it is only a matter of time before they are integrated into commonplace workflows and products. Tied to the point above on complexity, multimodal models will start to replace their narrower counterparts to solve more sophisticated problems. Today's models can, by and large, see, hear, read, plan, move, etc. The models of the future will do all of these simultaneously.

The Many Faces of Alignment

The future themes poised to gain prominence in AI not only encompass technological advancements but also their societal impacts. Among the onslaught of buzzy terms borne out of the conversations in San Francisco coffee shops, alignment has stood out among the rest as the catch-all for all the surrounding non-technical considerations of the broader implications of AI. According to ChatGPT:

AI alignment refers to the process and goal of ensuring that artificial intelligence (AI) systems' goals, decisions, and behaviors are in harmony with human values and intentions.

There are cascading conceptual circles of alignment dependent on the broadness of its application. As of now, the primary focus of laboratories and companies has been to align models to what is called a “loss function.” A loss function is a mathematical expression of how far away a model is from getting an answer “right.” At the end of the day, AI models are just very complicated functions, and all the surrounding infrastructure are very powerful functional optimization tool. A model behaving as it should as of now just means a function has been properly optimized to “having a low loss.”

It begs the question of how you choose the right loss function in the first place. Is the loss function itself aligned with the broader goal of the researcher building it? Then there is the question: if the researcher is getting what they want, does the institution the researcher is sitting in get what it wants? The incentives of a research team might not necessarily be aligned with those of the company. There is the question of how all of this is aligned with the interests of the broader public, and so on.

Dall-E’s interpretation of the main concentric circles of alignment

The clear direction here is that infrastructure for disentangling multilevel alignment seems inevitable (and necessary). Research in “superalignment” by institutions such as OpenAI, before their board debacle, is getting heavy focus in the community. It will likely lead to tools and best practices to help calibrate AI to human intention even as AI becomes increasingly powerful.

At the coarse-grained societal level, this is a broad regulation imposed by politicians who need help finding the Google toolbar. Broad-brushed regulations similar to what we see in the EU AI Act, are very likely to follow worldwide. Tech companies will get better at aligning models to their loss, researchers and alignment advocates at a loss to human goals, and regulators at the technology to the law. Regulation, self-regulation, and corrective mechanisms are bound to come—their effectiveness is still uncertain.

The AI Internet

A question in VC meetings all around the world is whether a small number of powerful foundation models will end up controlling all intelligence operations in the future or whether there will be a proliferation of smaller fine-tuned models floating around unmoored from centralized control. My guess is the answer is both.

Clearly, centralized foundation models perform quite well on generalized questions and use cases, but it will be difficult for foundation model providers to get access to proprietary datasets housed in companies and institutions to solve finer-grained, domain-specific problems. Larger models are also constrained by their size and much more difficult to embed in edge devices for common workflows. For these issues, corporations will likely use alternatives to control their own fine-tuned models. Rather than having one model control everything, the future is likely to have many more AI models than today.

The proliferation of AI models to come harkens back to the early proliferation of personal computing devices. The rise of the internet over the last 30 years has taught us a key lesson: things like to be connected. Intelligent models/agents will be no exception to this.

AI agents, another buzz term on the rise, are according to ChatGPT:

Systems or entities that act autonomously in an environment to achieve specific goals or perform certain tasks.

We are seeing an uptake now on AI agents powered by various models tasked with specific responsibilities. Perhaps this will come down even to the individual level, where each person has their own personal AI completing the routine monotonous tasks for them on a daily basis. Whether this occurs or not, it is only a matter of time before these agents start to connect and communicate with each other. My scheduling assistant AI will need to talk to your scheduling assistant. AI will be social!

My guess is a type of AI communication protocol will be one in which daisy-chaining models of different skills and occupations will exponentiate their individual usefulness. These communication protocols are still some ways from being established or formalized, but if the days of regular old computation mean much, they will not be far away. We are seeing the first Github repos showcasing orchestration systems of various models. While still crude, if you squint, you can see a world where this type of “AI internet” integrates into systems and workflows worldwide for everyday users.

Paywalling

The early internet provided a cornucopia of free content and usage powered by VC larges with the mandate of growth at all costs. It took a few years before the paywalls started, in news sites around the world, in walled-off premium features, and in jacked-up Uber rates. After proving the viability of a technology, the next logical step tends to be monetization.

For AI, the days of open papers, datasets, and sharing in communities are numbered as the profit engine picks up. We have already seen this in the increasingly, almost comically, vague descriptions OpenAI releases about their models. By the time GPT-5 rolls around, the expected release won’t be much less guarded than OpenAI just admitting, “we used GPUs for this.” Even non-tech companies are realising that the data they possess has tremendous value and will be much more savvy before letting it loose.

AI is still only a small portion of the economy at the moment, but its generality and unbounded potential stated above lead to the expectation that it can have absolutely enormous economic impact. Ironically, the value created by the early openness of technology will result in the end of technological sharing and a more closed mentality.

The last generation of tech growth has been fueled by social media and “attention.” Any barriers to engagement, such as putting a credit card upfront, were discouraged, and the expectation that “everything is free” became commonplace in using many internet services. OpenAI, in contrast, rather than starting with a traditional ad-based approach for monetization, opened up a premium subscription service and is now charging hefty sums for tailored models for corporations. The value of AI technology in its own right obviates the middle step of funding through advertising. Data and intelligence will likely not come for free.

As we shift from an attention economy to an intelligence economy, where automation becomes a core driver of growth, expect the credit cards to start coming out.

blog image

Dall-E’s interpretation of the coming AI paywall paving the transition from an attention economy to an intelligence economy

Expect the Unexpected

As a usual mealy-mouthed hedge in any predictive article, the requisite disclaimer of the unimaginable items must be established. In this case, this is also a genuine belief. Even natural extrapolations of AI technology moving forward can leave us in heady disbelief of possible future states. Even much smaller questions, like if OpenAI itself will survive in a year, are extremely difficult to predict.

If you asked someone 50 years ago about capturing some of the most magnificent imagery in the world, of items big or small, wonders of the world captured within a device in the palm of your hand and served in an endless scroll among other wonders, it would seem possible and yet inconceivable. Now, we are bored by seeing some of the world's most magnificent, spectacular images and events. Our demand for stimulating content is being overtaken by supply. Analogously, with AI, we might be in a world where scientific progress is accelerated beyond our wildest dreams, where we have more answers than questions, and where we cannot even process the set of answers available to us.

Using AI, deep mathematical puzzles like the Riemann Hypothesis may be laid bare as a trivial exercise. Yet, the formulation of interesting questions might be bottlenecked by our own ability and appetite to answer them. A machine to push forward mathematical progress beyond our dreams might seem too much to imagine, but it’s only one of many surreal potential futures.

If you let yourself daydream of infinite personal assistants, where you have movies of arbitrary storylines created on the fly for individual consumption, where you can have long and insightful conversations with a cast of AI friends, where most manual and cognitive work of the day has completely transformed, you start to realize that it will be difficult to precisely chart out where AI is going.

There are of course both utopian and dystopian branches of these possibilities. The technology is agnostic to moral consequence; it is only the people using it and the responsibility they incur that can be considered in these calculations. The only thing to expect is that we won’t expect what’s coming.

Conclusion

Is ChatGPT the equivalent of AI what the iPhone moment of the app wave was in the early 2010s? Possibly—and probably why OpenAI ran a very Apple-like keynote before Sam Altman’s shocking dismissal and return. But what is clear is that once items have permeated into public consciousness, they cannot be revoked. People understand the potential now. Just 3 years ago a company struggling to raise a seed round had to compete for attention against crypto companies, payments processors, and fitness software. AI companies today are a hot ticket item and have huge expectations baked into this potential.

It was only 9 months ago that I wrote about “bridging the gap” to production AI. Amidst all the frenzy around AI, it is difficult to forget that most models today are still only in the “POC” (Proof of Concept) state, not having proved sufficient value to be integrated with real-world applications.

ChatGPT really showed us a world beyond just production, to “post-production” AI, where AI's broader societal interactions and implications become more of the story than the technological components that it’s made of. We are now at the dawn of the “Post-Production” era.

Where this will go exactly is of course impossible to say. But if you look at the past, and at the present, the themes to watch for are: complexity, multi-modality, connectivity, alignment, commercialization, and surprise. I am certainly ready to be surprised.

Power your AI models with the right data

Automate your data curation, annotation and label validation workflows.

Get started

Written by

Eric Landau

View more posts

Previous blog

Product Updates [September 2023]

Next blog

Logistic Regression: Definition, Use Cases, Implementation

Related blogs

View all

sampleImage_explained-new-ai-executive-order-open-vs-closed-source

Learn

Understanding the United States Executive Order on Safe, Secure, and Trustworthy AI

On October 30, 2023, the White House announced an Executive Order issued by President Joe Biden aimed at fostering a balanced approach toward the development and deployment of Artificial Intelligence (AI) to ensure it's safe, secure, and trustworthy. It acknowledges the potential of AI technologies in solving urgent societal challenges and enhancing prosperity, productivity, innovation, and security. However, the Executive Order highlights the potential adverse effects that an irresponsible use of artificial intelligence could have, such as fraud, discrimination, bias, misinformation, threats to national security, and the need for guardrails. The Order calls for a collective effort from the federal government (including the Department of Homeland Security, the Department of Health and Human Services, the Department of Energy, the Department of Commerce, and more), the private sector, academia, and civil society to mitigate these harms while maximizing the benefits of AI. Here are the three main guiding principles behind this Executive Order: Safety and security: The Order emphasizes the need for robust, reliable, repeatable, and standardized evaluations of AI systems. It mandates addressing security risks, including those related to biotechnology, cybersecurity, and critical infrastructure. The document also highlights the importance of testing, post-deployment monitoring, and effective labeling to ensure that AI systems are ethically developed, securely operated, and compliant with federal laws. Responsible innovation: It encourages promoting responsible innovation, competition, and collaboration to maintain U.S. leadership in AI. The Order calls for investments in AI-related education, training, development, research, and tackling intellectual property issues. It also emphasizes creating a fair, open, and competitive AI ecosystem and marketplace, supporting small developers, and addressing potential risks from dominant firms' control over critical assets like semiconductors, computing power, cloud storage, and data. Supporting American workers: As AI creates new jobs and industries, the Order stresses adapting job training and education to support a diverse workforce. It advises against deploying AI in ways that undermine rights, worsen job quality, or cause harmful labor-force disruptions. The Order encourages building the next steps in AI development based on the views of workers, labor unions, educators, and employers to support responsible AI uses that improve workers' lives and augment human work. In subsequent sections of this article, we will examine the actions among the AI directives in this Executive Order. In the meantime, let’s explore how we got here. How did we get here? The History of AI Regulation in the United States of America President Biden's Executive Order on Safe, Secure, and Trustworthy Artificial Intelligence is the result of years of developing insights and responses to emerging technologies in the field of AI. In order to show how we came to this important turning point, this section will walk you through the path of AI regulation in the United States. Early Engagement, Regulating Open- and Closed-Source LLMs Navigating the spectrum between open and closed LLM systems is critical for effective AI policy. Striking the right balance will promote innovation and competition while managing the potential risks of AI. By 2024, the National Institute of Standards and Technology (NIST) under the U.S. Department of Commerce will determine whether they will allow the release of open model weights under public licenses. This, of course, is bound to stir up discussions surrounding treating open model weights as free speech and accusations of lobbying from big tech companies to protect their MOAT. As these LLM systems began permeating various sectors, the need for a regulatory framework became apparent. Policymakers grappling with the rapid advancements in AI models and tools started the conversation about balancing promoting US global leadership in AI with the risks to individuals, businesses, and national security. Legislative Efforts The early engagement translated into legislative action, with the USA’s House and Senate committees holding numerous hearings on AI. The hearings included big names like Elon Musk, CEO of SpaceX, Tesla, and X, formerly known as Twitter; Mark Zuckerberg, CEO of Meta; former Microsoft co-founder Bill Gates; and Sam Altman, CEO of OpenAI, the parent company of AI chatbot, ChatGPT. Biden Administration’s Early Steps In October 2022, the Biden administration issued a non-binding AI Bill of Rights, marking an early step towards delineating the government’s stance on governing automated systems, focusing on civil rights protection. Soon after, on September 12, several tech companies signed voluntary agreements to follow the rules President Biden set out for AI. This was the first step toward encouraging responsible AI use through partnerships with the private sector. SAFE Innovation—A Values-Based Framework and New Legislative Process Despite strong bipartisan interest, the challenge of passing comprehensive AI legislation continued, paving the way for the SAFE Innovation Framework proposal by Senate Majority Leader Chuck Schumer. The Executive Order The culmination of these efforts and the evolving understanding of AI's impact led to the issuance of the Executive Order on Safe, Secure, and Trustworthy Artificial Intelligence. This Executive Order embodies a more structured approach to AI governance, reflecting the administration’s commitment to promoting responsible AI development and deployment while addressing the associated potential risks of AI. What are the Executive Order Directives? We have summarized the Executive Order Directives below so you can easily skim through and find the directives and the corresponding actions relevant to you. Directive 1: New Standards for AI Safety and Security Actions: Require developers to share safety test results with the U.S. government. Develop standards and tools to ensure AI systems are safe and secure. Protect against AI-enabled risks to national security and public health. Establish strong standards for biological synthesis screening. Directive 2: Protecting Americans’ Privacy Actions: Prioritize federal support for privacy-preserving techniques in AI. Strengthen privacy-preserving research and technologies. Evaluate how agencies collect and use commercially available data. Develop guidelines for federal agencies to evaluate privacy-preserving techniques. Directive 3: Advancing Equity and Civil Rights Actions: Offer advice to stop AI programs from making discrimination worse. Address algorithmic discrimination through training and coordination. Ensure fairness in the criminal justice system's use of AI. Directive 4: Standing Up for Consumers, Patients, and Students Actions: Make advances in the responsible use of AI in healthcare. Shape AI’s potential in education. Protect consumers and patients while ensuring AI benefits. Directive 5: Promoting Innovation and Competition Actions: Catalyze AI research and provide grants in vital areas. Promote a fair and competitive AI ecosystem. Streamline visa criteria for skilled immigrants. Directive 6: Supporting Workers Actions: Develop principles and best practices for worker protection. Produce a report on AI’s labor-market impacts. Directive 7: Advancing American Leadership Abroad Actions: Expand collaborations on AI at bilateral, multilateral, and multistakeholder levels. Accelerate the development of AI standards with international partners. Promote responsible AI development abroad. Directive 8: Ensuring Responsible and Effective Government Use of AI Actions: Issue guidance for agencies’ AI use. Streamline AI product and service acquisition. Accelerate the hiring of AI professionals in government. Now that we've discussed the key directives of the US Executive Order on AI, let's compare and contrast them with the European Union's approach to AI regulation, known as the EU Artificial Intelligence Act (AI Act). US Executive Order on Safe, Secure, and Trustworthy AI vs European Union AI Act In the table below, we present a comparative overview of the key aspects and focus areas of the US Executive Order on Safe, Secure, and Trustworthy AI and the EU Artificial Intelligence Act (AI Act). Read more about the takes on “Proposed AI Regulation: EU AI Act, UK's Pro-Innovation, US AI Bill of Rights” from Encord’s co-founder and president. As you saw in the comparison, while both regulations aim to foster a safe and responsible AI ecosystem, they approach AI governance from slightly different vantage points, reflecting the distinct priorities and regulatory philosophies of the US and the EU. What does the European AI Act mean for you, an AI developer? Learn more from this article by Ulrik Stig Hansen, Encord’s co-founder and president. Conclusion Increased involvement from policymakers, legislative efforts, and joint initiatives between the public and private sectors have all contributed to the current AI regulatory landscape. The issuance of the Executive Order represents a significant milestone in the ongoing journey towards establishing a robust framework for AI governance in the U.S. aimed at harnessing the benefits of AI while mitigating its potential perils. But will regulations stifle the efforts of open-source AI? Or would it encourage an ecosystem of open innovation while regulating the risks at the application layer? In this article, you learned about the evolution of AI regulation in the U.S., focusing on key legislative efforts, the Biden Administration's early steps towards AI governance, and the collaborative initiatives that marked the journey towards the recent Executive Order. We talked about how AI was regulated, which led to the Executive Order on Safe, Secure, and Trustworthy Artificial Intelligence. These included actions taken by lawmakers, tech companies making voluntary commitments, and the release of frameworks based on values like the SAFE Innovation Framework. Finally, we compared different aspects of the directives to the proposed European Union AI Act, where you saw clearly different priorities and regulatory philosophies between the United States Congress and the European Parliament. Get access to our new AI Act Learning Pack, which includes all the key resources you need to ensure forward compatibility.

Nov 01 2023

5 M

sampleImage_why-ai-is-the-mother-of-all-unicorns

Company

Why AI Is the Mother of All Unicorns

"Say you want to watch a movie. To choose, you'll want to know what movies others liked and, based on what you thought of other movies you've seen if this is a movie you'd like. You'll be able to browse that information. Then you select and get video on demand. Afterward, you can even share what you thought of the movie. But thinking of it only in terms of movies on demand trivializes the ultimate impact. The way we find information and make decisions will be changed. Think about how you find people with common interests, pick a doctor, and decide what book to read. Right now, reaching out to a broad range of people is hard. You are tied into the physical community near you. But in the new environment, because of how information is stored and accessed, that community will expand. This tool will be empowering, the infrastructure will be built quickly and the impact will be broad." - The Bill Gates Interview, Playboy Magazine, July 1994 Sound familiar? Asked what else the personal computer was supposed to do other than process documents, Bill Gates prophesied the changes brought about by the coming of the information age that modern-day tech giants have since realized. From video-on-demand and movie recommendations (Netflix) to the way we find information (Google) to how you find people with common interests (Facebook) and deciding what book to read (Amazon 1.0), Gates' vision of the transformation that the information age would bring about turned out in more ways than one could conceivably imagine at the dawn of the Internet revolution. The Coming of the AI Revolution Fast forward 20 years, the AI revolution has begun. It will fundamentally transform our world, much just like the advent of the atomic bomb, microprocessor, personal computer, and the Internet. If the wealth generated from the emergence of each of these technologies offers any indication, we are poised to witness an unprecedented accumulation of wealth. As with any prophesied significant platform shift, there's a real risk that they fail to materialize in a big way at a particular moment in time (e.g. Web3, Blockchain, Crypto, Metaverse) or that they take much longer than anticipated to play out (admittedly, crypto can still find an actual use case). Until as recently as ten years ago, almost all AI systems failed to demonstrate significant value, and many still do not (e.g., purely logic-based AI systems and symbolic AI - the dominant paradigm from the 1950s to the mid-1990s - are still primarily research interests). We could be in for another AI hype cycle that may eventually fizzle. As an eternal optimist and founder of an AI company looking to raise a Series B in the not-too-distant future, I won't bother spelling out why AI is overrated. Instead, I'll argue why it will change the world. When I explain AI to my parents, I describe it as a new form of dynamic software built on answers, unlike traditional status software built on rules. Put simply, comparing AI to conventional software is like saying "show" instead of "tell." What's exciting about AI is that dynamic and answer-based software will enable us to create new products, applications, and systems that can solve unsolved problems that, until now, have been reserved for human cognition. Self-driving cars are the most obvious example - while traditional software can handle simple tasks like driving straight, building a fully autonomous vehicle would require an overwhelming number of static rules to cover even the basics of navigation. It is not a leap to believe that the total addressable market (TAM) of problems only solvable by human cognition is orders of magnitude higher than that of any of the problems for which we use traditional software. As AI can augment and - in some cases - replace humans, it can produce what I think of as "non-linear" productivity outcomes. Here are a few contrived examples across various vertical use cases to illustrate the potential non-linearity of AI systems: Building a faster car to reduce the amount of attention required to drive from A to B (linear) vs. self-driving vehicle (non-linear) More efficient organization of leads and tasks in a CRM system with a slightly better UI for salespeople (linear) vs. AI talking avatars that allow for infinite scaling of the salesperson (non-linear) Improved diagnostic equipment that provides more detailed images for radiologists to analyze (linear) vs. AI-driven systems that scan medical images and highlight potential anomalies for doctors or even predict possible illnesses before symptoms manifest based on health data (non-linear) Better tractors and machinery to help farmers plant and harvest crops (linear) vs. drones and robots that monitor the health of individual plants, apply precise amounts of fertilizer or pesticide, and harvest crops with minimal human intervention (non-linear) A digital learning and education platform with improved video lectures and homework targeted specifically at programming (linear) vs. an adaptive chatbot that can be prompted to "explain this concept like I'm 12 years old" (non-linear) The digital learning example is interesting as it is playing out in real-time: Chegg, the education technology company, saw its stock price tumble 47% (down ~63% year-to-date) after admitting that ChatGPT was pressuring its subscriber growth, leading them to suspend their full-year outlook. You get the idea. Just as the Internet's value skyrocketed with evolving applications, tools, and increased user participation, so too will the AI sector's worth. Despite the Internet's basic components remaining similar to those of the early 1990s, its value has grown exponentially over 20 years due to expanded applications and user engagement. As more individuals and businesses embrace AI and develop applications, the supporting tools and infrastructure will improve. Increased data availability will also enhance product quality. This cyclical improvement will fuel exponential growth in AI. Undoubtedly, AI is poised to be the next major technological platform shift within the next 20 years. While previous technological revolutions, like the Internet or the Industrial Revolution, were monumental in reshaping societies and economies, AI encapsulates something far more profound: the essence of human cognition. The wealth generation from novel solutions to previously unsolvable problems, combined with the heightened productivity and efficiency across all sectors, implies that the economic impact of AI could dwarf that of all prior technological shifts. The emergence of AI will give birth to an unfathomable number of unicorns. Who Wins: Titans, Challengers, or Innovators? Ok, so AI will be huge, but who wins the biggest slice of the pie, and where will the most value be generated? While the Twitter VC community may have its own predictions, here are my thoughts on a potential outcome. I could of course be entirely wrong. Early winners like NVIDIA have already experienced a surge in their stock price, and investors believe that generative AI and LLM developers will be the next big thing, as evidenced by the high valuations and significant investment flowing into those companies. The landscape is already fiercely competitive, especially among "neo" foundation model/LLM providers (e.g., Cohere, Anthropic, Mistral). Given the high valuations and evolving competitive landscape, I question the viability of venture-scale returns for most of these new entrants. There is a limit to how many chatbots the market can absorb, after all. Additionally, OpenAI is also so far ahead (8 years of R&D and billions of queries via ChatGPT generating valuable RLHF data) that it will be difficult for any of these companies to catch up. Perhaps one or two will succeed, but for emerging LLM companies to truly thrive, they will likely need to uncover unique niches or pivot towards refining larger models using specialized, proprietary datasets for distinct needs and/or partnering with downstream application developers. Some corporate/venture combinations could also happen, where partnerships like OpenAI/MSFT, Anthropic/Google, and Cohere/Meta-type will combine distribution and data advantage with R&D expertise. Elad Gil made some interesting observations on this here. Separately, foundation model providers will likely realize lower margins in the first few years of operation, as they more closely resemble 'hardware-type companies' with significant upfront training expenses. However, this does not mean that developers who create applications using these large models won't achieve considerable margins even if the TAM of those markets is much smaller, for example, by offering specialized expertise, products, and other value-added services related to these models. Jasper is an example of a company that has done this perfectly - they've built a product that serves marketers, and just marketers, well. All things considered, the AI market will probably resemble that of the current software market in 20 years. There will be a few huge "Big AI" companies with over $100 billion in revenue (this could very well end up being the foundation model developers, but it could also be companies that we haven't even conceived of yet) and a diaspora of large companies focusing on specific applications (e.g., Stripe for payments, Uber for transportation, Figma for design - this could be Jasper for marketers, Cruise for autonomous vehicles, Viz AI for medical imagery, and so on). For context, Apple, Microsoft, Amazon, Meta, and Alphabet constitute ~$9 trillion of the value of the NASDAQ's ~$22 trillion market cap. This is a substantial chunk, no doubt, but the total size of the pie is undoubtedly only going to get bigger as the AI market gets underway and secular trends in technology continue to reverberate. Why This Time is Different Technological revolutions often occur due to a convergence of pivotal factors, and the AI sector is currently experiencing such a juncture. Similar to the Internet's ascendance, which was enabled by ubiquitous personal computers and faster connectivity, the current AI boom is a product of simultaneous advances in computing power, vast data availability, and increasingly advanced models. Eric and I founded Encord at a pivotal moment when object detection models transitioned from often being erroneous and requiring highly controlled "sandbox"-type environments to delivering tangible ROI. Similarly, the release of ChatGPT marked a paradigm shift in how we approached natural language processing and understanding. Looking into the near future, I anticipate AI delving deeper into multi-modal applications, offering higher ROI and increasingly viable solutions to more complex problems, and even stepping into realms of human reasoning. In short, the market is just getting started. The value, revenue, TAM, etc., will naturally accrue as the complexity of the problems that we solve with AI increases. After all, it was impossible to stream a movie over your Internet connection 20 years ago, but now its table stakes.

Aug 24 2023

4 M

sampleImage_fireside-chat-victor-niantic

Video

Fireside Chat: AI for Augmented Reality in 2023 and Beyond

This edition of Encord’s Fireside Chats sees Victor Prisacariu, of the University of Oxford and Niantic, sit down with Eric Landau, Encord’s CEO and Co-Founder, to discuss recent developments in AI computer vision and machine learning. Victor’s focus currently lies in real-time Augmented Reality on mobile and wearable platforms, having co-founded 6D.ai which was later acquired by Niantic in March 2020. With his wide and varied experience of the industry, Victor touched crucial areas of AR as well as discussing his work at Niantic in-depth.

May 25 2023

3 M

Dive Into Complexity

Multi-modality

The Many Faces of Alignment

The AI Internet

Paywalling

Expect the Unexpected

Conclusion

Encord Blog

One Year of ChatGPT - Here’s What’s Coming Next

Power your AI models with the right data

Dive Into Complexity

Multi-modality

The Many Faces of Alignment

The AI Internet

Paywalling

Expect the Unexpected

Conclusion

Written by

Dive Into Complexity

Multi-modality

The Many Faces of Alignment

The AI Internet

Paywalling

Expect the Unexpected

Conclusion

Power your AI models with the right data

Written by

Product Updates [September 2023]

Logistic Regression: Definition, Use Cases, Implementation

Related blogs

Understanding the United States Executive Order on Safe, Secure, and Trustworthy AI

Why AI Is the Mother of All Unicorns

Fireside Chat: AI for Augmented Reality in 2023 and Beyond

Meta’s Llama 3.1 Explained

Top 10 Multimodal Models

Introducing TTI-Eval: An Open-Source Library for Evaluating Text-to-Image Embedding Models

AI as a Service: The Ultimate AIaaS Guide for Business in 2024

Intelligent Process Automation Vs. Robotic Process Automation: Key Differences

Llama 3V: Multimodal Model 100x Smaller than GPT-4

GPT-4o vs. Gemini 1.5 Pro vs. Claude 3 Opus: Multimodal AI Model Comparison

Meta Imagine AI Just got an Impressive GIF Update

Knowledge Distillation: A Guide to Distilling Knowledge in a Neural Network

What is Continuous Validation?

Best Practices for Handling Unstructured Data Efficiently

Ray-Ban Meta Smart Glasses are Getting an Upgrade with Multimodal AI

Phi-3: Microsoft’s Mini Language Model is Capable of Running on Your Phone

DataOps Vs MLOps: What's the Difference?

Overfitting in Machine Learning: ​​How to Detect and Avoid Overfitting in Computer Vision?

Top 8 Alternatives to the Open AI CLIP Model

Meta AI’s Ilama 3: The Most Awaited Intelligent AI-Assistant

MM1: Apple’s Multimodal Large Language Models (MLLMs)

Diffusion Transformer (DiT) Models: A Beginner’s Guide

Google’s Video Gaming Companion: Scalable Instructable Multiworld Agent [SIMA]

What is Robotic Process Automation (RPA)?

YOLO World Zero-shot Object Detection Model Explained

Top 9 Tools for Generative AI Model Validation in Computer Vision

Mistral Large Explained

An Overview of the Machine Learning Lifecycle

YOLOv9: SOTA Object Detection Model Explained

Introduction to Krippendorff's Alpha: Inter-Annotator Data Reliability Metric in ML

Model Drift: Best Practices to Improve ML Model Performance

AI in 2023: A Retrospective

Logistic Regression: Definition, Use Cases, Implementation

What is Ensemble Learning?

Accuracy vs. Precision vs. Recall in Machine Learning: What is the Difference?

Data Clustering: Intro, Methods, Applications

Mastering Supervised Learning: A Comprehensive Guide

MiniGPT-v2 Explained

Top Multimodal Annotation Tools

GPT-4 Vision vs LLaVA

Zero-Shot Learning (ZSL) Explained

Mistral 7B: Mistral AI's Open Source Model

Activation Functions in Neural Networks: With 15 examples

Meta-Transformer: Framework for Multimodal Learning

Training, Validation, Test Split for Machine Learning Datasets

Meta Training Inference Accelerator (MTIA) Explained

The Full Guide to Embeddings in Machine Learning

Human-in-the-Loop Machine Learning (HITL) Explained

The Step-by-Step Guide to Getting Your AI Models Through FDA Approval

Webinar: Are Visual Foundation Models (VFMs) on par with SOTA?

Overfitting in Machine Learning: How to Detect and Avoid Overfitting in Computer Vision?