MELTing Point: Mobile Evaluation of Language Transformers
We explore the feasibility of deploying LLMs on device, a model in which user prompts and LLM outputs never leave the device premises.
Read this article →In August 2023, Brave released Leo—Brave’s native AI assistant—on its Nightly channel. Since then we’ve incorporated a lot of user feedback, and made numerous improvements to the feature. There are more changes on the way, and soon we will launch Leo on our full release channel, making it available to our 60 million users.
We thought now would be a great time to share more about how Brave is thinking about Leo, and the future AI integrations within the browser.
Brave has always been a user agent. A user agent is a client application that gives users better control of the content they access on the Internet, without conflicts of interest between server operators – including third parties – and the user. Leo takes this concept one step further, moving beyond the limited stance of browsing content in the way the author or creator intended. With Leo, you’re not passively consuming web pages and videos, rather you’re actively exploring content: asking tangential questions, seeking clarifications, exploring diverse viewpoints, and overall tailoring the Web to your preferences.
Leo “understands” the current page you’re visiting—it’s the next best thing to talking directly with the author. In fact, in many ways it’s better, as Leo can independently analyze and reason about pages, free of the author’s or creator’s biases.
To be clear, Brave isn’t trying to create the world’s leading Large Language Model (LLM) from scratch. Instead, our goal is to link users with top LLMs and harness them in innovative and where feasible, privacy preserving ways. Our goal is to open novel, convenient, and private use cases that are often possible only in the context of a particular user’s browsing session.
As with all Brave products, Leo adheres to Brave’s core values and is built with privacy by design. User inputs are always submitted anonymously through a reverse-proxy to our inference infrastructure, giving users an AI experience with unparalleled privacy. Also, conversations are discarded immediately after the reply is generated and are not persisted on Brave’s servers. As we expand Leo’s development, we will continue to integrate privacy into its features and to innovate with new approaches to give our users the privacy they expect.
The first step to opening this amazing feature to our general release channel is to make sure the cost is viable for Brave to offer to its full user base. With more than 60 million users worldwide, the cost of running Leo as-is on our general release channel would be prohibitive without a paid option. For this reason, we’ll soon be offering a premium version of Leo. Alongside the free version of this feature, the premium version will cover the costs of hosting our premium models, and for paying the API access for other premium model choices.
This next update to Leo will include:
Here’s a table that summarizes the free / premium options:
Feature | Free Leo | Leo premium |
---|---|---|
Models | Llama 2 13B Code Llama 13B |
Llama 2 13B Llama 2 70B Code Llama 13B Code Llama 70B Anthropic Claude Instant |
Rate limits | Reasonable rate limits | Higher rate limits |
Quality of conversations | Limited by models | Very high |
Privacy | Inputs are always submitted anonymously through a reverse-proxy and are not retained. | Inputs are always submitted anonymously through a reverse-proxy and are not retained. |
We’re targeting mid to late Q4 for the Leo release.
Mobile parity will most likely follow shortly after the desktop release.
The focus of the mobile release will be to make sure everything from the desktop release works on Android and iOS.
Major enhancements and features for Leo are described and grouped into categories below. Some of these features will be available with the first release, and others will roll out during each of our browser updates.
Today, if Leo is prompted in another language, it can already answer in that language (such as French and Spanish). However, not all languages are covered, and a page’s context, when used, will likely determine the response language. We’d like to add better handling for languages, and overall to expand the number of languages, and the quality of the output of those languages, by adding better international models.
With Leo by their side, users can interact with the Web in groundbreaking new ways. The future of Leo outlined in this roadmap is exciting to us. Unlike other AI assistants, Leo is not a simple chat interface. Leo is a companion for helping you make sense of the Web as you browse, while preserving your privacy and helping you stay in charge of your experience.
We genuinely value insights from our community. For any thoughts or feedback, connect with us on our community question and answer site.
We explore the feasibility of deploying LLMs on device, a model in which user prompts and LLM outputs never leave the device premises.
Read this article →Leo—Brave's in-browser AI assistant—is now even more useful thanks to its integration with Brave Search.
Read this article →Answer with AI is the only large real-time answer engine that puts privacy first and does not rely on Big Tech search engines
Read this article →