Vector databases are gaining popularity, with many startups diving into the field and investors showing keen interest. This surge is fueled by the rise of large language models (LLMs) and generative AI (GenAI) technology, which have paved the way for vector database innovations to thrive.
Traditional databases like Postgres or MySQL are good for organizing structured data, which is data that fits neatly into rows and columns with specific types like numbers or text. But they're not so great for handling unstructured data, like pictures, videos, emails, or social media posts, where there's no clear format or organization.
Vector databases are like smart organizers for data. Instead of just storing information in its original form, like words or pictures, they transform it into numbers that show what it means and how it's connected to other things. This makes it simpler for computers to understand and find similar information. It's like putting related things closer together on a shelf so you can find them easily.
This is really useful for smart AI chatbots like OpenAI's GPT-4. It helps them understand conversations better by checking out similar chats from before. And it's not just for chatbots! It's also super helpful for things like suggesting cool stuff to check out on social media or shopping apps. It can quickly find things similar to what you've been looking for.
Using vector search can also make LLM applications better by giving them more details, even if those details weren't in the training data.
"Even if you don't use vector similarity search, you can still create AI and machine learning apps. But without it, you'll have to spend more time training and adjusting the models. Vector databases are helpful when you have a big dataset and need a tool to manage vector embeddings quickly and easily."
In January, Qdrant got $28 million to grow even more. It's now one of the top 10 fastest-growing open source companies. But it's not alone. Other companies like Vespa, Weaviate, Pinecone, and Chroma also got a lot of money last year—$200 million in total—for their vector databases.
Recently, we've witnessed some exciting developments in the startup world. Index Ventures spearheaded a $9.5 million investment in Superlinked, a platform that simplifies tricky data into easy-to-understand vectors. Also, Y Combinator (YC) introduced its Winter '24 cohort, featuring Lantern, a startup offering a handy search engine for Postgres databases.
Elsewhere, Marqo got $4.4 million last year and then got another $12.5 million in February. Marqo has a platform with lots of vector tools that you can use right away. These tools help with making, storing, and finding vectors. With Marqo, you don't need other tools from companies like OpenAI or Hugging Face. Everything you need is in one place, and you can access it all using just one API.
Marqo's founders, Tom Hamer and Jesse N. Clark, used to work at Amazon, where they noticed a big problem: it was hard to search for things in different ways, like using words or pictures. So, in 2021, they left Amazon and started Marqo to solve this problem.
0 Comments