

Unstructured data will account for 80%






Here is an EASY thread

$MDB MongoDB was initially released in 2009
and NoSQL quickly became a good alternative / complement to relational databases 
Certainly for the modern web giants
that had to deal with increasing amounts of data and real-time web applications


Certainly for the modern web giants


OK… That isn’t very telling
Let’s dive into databases
What are relational databases
What are non-relational databases 
What are the use cases for relational databases 
What are the use cases for non-relational databases












When a system or developer needs information


Storing data on different tables enables for more flexibility
and less redundancy 
E.g. A shop has information on its customers
(Customer ID, name, state, mailing address) and their purchases
(Customer ID, purchases, time of visit)


E.g. A shop has information on its customers











An alternative would have been to create a huge database
with Customer ID, name, state, mailing address, purchases, time of visit
But that would NOT be EFFICIENT


A key aspect of relational databases is that data needs to be saved according to a pre-determined model 
These make sure that relational databases are consistent
Meaning that readers ALWAYS receive the last updated information
or an error








Careful UPFRONT design
is thus required and changes have to be carefully implemented
thereafter (sometimes including downtime)
This is a problem for data where the relationships change often or when the data lacks structure




Another key weakness is SCALING
When the amounts / complexity of the data you handle changes
What can you do?
⇣ VERTICAL SCALING
Add more power (CPU, RAM)
to make your machines faster
⇢ HORIZONTAL SCALING
Add more machines 


When the amounts / complexity of the data you handle changes

⇣ VERTICAL SCALING


⇢ HORIZONTAL SCALING




⇣ VERTICAL SCALING in this case, the data resides in the same node (or database instance - place where the data is saved and can be handled)
Costs rise exponentially when adding CPUs and RAM
to ONE machine as complexity rises



E.g. buying one high-performance custom made computer is an order of magnitude more expensive than buying 10 standard and average performance computer
⇢ HORIZONTAL SCALING in this case, the data is distributed over several machines 


Costs are kept under control as more “simple” machines can be used as one machine doesn’t need to execute all of the work









Hint






These documents (equivalent of the “rows” in relational databases”) can be arranged in “collections” (equivalent of the “tables” in relational databases) inside the database
But… Since we can compare documents to rows and collections to tables… isn’t it the same thing?


Well no, the major difference lies in the fact that different documents DO NOT NEED TO SHARE THE SAME STRUCTURE 
This is key difference from relational databases
where every row has the same fields

















Data that is unstructured

Lots of data


Does it also provide HORIZONTAL SCALABILITY



YES 
While you cannot in principle split tables (from relational databases) on different nodes
You CAN DISTRIBUTE documents (from non-relational databases) over several machines 


This enables for horizontal scaling

While you cannot in principle split tables (from relational databases) on different nodes









But what about these duplicates? From IBM 
“That means the information you receive from a query may be incorrect by a few seconds—perhaps up to half a minute. On social media sites, this means seeing an old profile picture when the newest one is only a few moments old...

“That means the information you receive from a query may be incorrect by a few seconds—perhaps up to half a minute. On social media sites, this means seeing an old profile picture when the newest one is only a few moments old...
...The alternative could be a timeout or error. On the other hand, in banking and financial transactions, an error and resubmit may be better than old, incorrect information.”
You can see it here
Non-relational databases are good for some websites
But others cannot work with “false information” 
This is why NoSQL is now said to stand for “Not Only SQL”
Where SQL refers to Structured Query Language (language for relational databases)








All in all, there are some use cases where relational databases
are a better fit 
And some other use cases where non relational databases work best




Relational databases are best used for tasks that fail as whole OR succeed - This can be said of





As you can imagine, a financial transaction needs to succeed or fail







These all have to deal with vast amounts of (un)structured content that often changes such as messages, pictures, comments, live data AND can fail / return an error temporarily


According to http://Scalegrid.io (provides services for non and relational databases)




As we can see
$MDB is one of the most popular NoSQL databases 
Is the market served by these databases growing 
In other words, is the market for UNSTRUCTURED
and LARGE AMOUNTS
of data growing?







Well, according the IDC the size of worldwide data will reach 175 ZettaBytes by 2025 
Representing a CAGR of 61% over the 2019 - 2025 period
80% of that data will be unstructured
and 49% of the whole data will be in public clouds





In terms of market size
Allied Market Research places the NoSQL market at $ 22B in 2026 - up from $ 2.5B in 2018
Good for a CAGR of 31.4% from 2019 to 2026
Driven by increased demand from 
Ecommerce
Web applications
Social gaming

Good for a CAGR of 31.4% from 2019 to 2026






Let’s look at DB-Engines for historical evolution



When looking at the bigger picture
we can see that established SQL
databases are not loosing too much ground 
PostgreSQL (relational database) and MongoDB (Non relational database) are both gaining in popularity




It is clear that $MDB is NOT set to replace relational databases
These are still widely used 
But we have seen that $MDB is leading
the document stores databases by a WIDE margin
And searches for “MongoDB Atlas”
trend upwards over the last 5 years globally







That is a LOT of information
A summary
Non-relational databases are better than relational in some cases (when data amounts are huge, changes over time)
Non-relational databases SCALE way better and are more flexible to real-life data changes





So far so good! Let’s have a look at $MDB itself 
$MDB generates sales (total of $ 138m) from 3 main sources
Atlas related subscriptions ($ 61m and growing 69% YoY)
Other subscriptions ($ 71m and growing 24% YoY)
Services ($ 5m - stable)

$MDB generates sales (total of $ 138m) from 3 main sources




The “Other subscription” is basically the “MongoDB Enterprise Advanced”
product offered by $MDB
This enables businesses to run $MDB on their own infrastructure and customise it very finely to their needs


MongoDB Atlas
is the one providing the growth behind the $MDB story 
It is a cloud Database-As-A-Service that offers a free tier
is fully managed
provides on-demand scaling
and real-time insights







To say that Atlas is an incredible product is an understatement 
“MongoDB Atlas, which launched in June of 2016, is a battle-tested database-as-a-service platform (DBaaS) artfully designed and built by the same team that created and continues to nurture and grow MongoDB...


...MongoDB Atlas is a true blessing to the development community; it provides all of the features of its database counterpart, without the operation and heavy lifting normally required when building new applications, letting you focus on what you do best.” - Nick Parsons
The full review is right here
https://medium.com/@nparsons08/mongodb-atlas-technical-overview-benefits-9e4cff27a75e


You know what is good about $MDB Atlas?
It is a usage-based service
Remember the growth in unstructured DATA? Well, $MDB is growing not only by adding more customers 
BUT ALSO as its current customers have more data to handle





Here is @cameroniadeluca excellent take on the matter
https://twitter.com/cameroniadeluca/status/1204067906503479297?s=20

Financials check 
Sales grew by 39% YoY
to $ 138m per quarter 
Gross margins stand at 69%
Down from 72% in prev. Q.
Loss from operations stood at $50m up from a loss of $ 38m a year earlier
























Disclaimer - This is not investment advice in any form and investors are responsible for conducting their own research before investing.
Sources
✑ Investor presentation
✑ Company website
✑ SimilarWeb
✑ DB Engines
✑ IDC
✑ Allied Market Research
✑ IBM
✑ Guru99
Sources
✑ Investor presentation
✑ Company website
✑ SimilarWeb
✑ DB Engines
✑ IDC
✑ Allied Market Research
✑ IBM
✑ Guru99
✑ Forbes
✑ High Scalability
✑ InfoWorld
✑ Packt
✑ Dataversity
✑ ScaleGrid
✑ Nick Parsons on Medium
✑ PCMag
✑ High Scalability
✑ InfoWorld
✑ Packt
✑ Dataversity
✑ ScaleGrid
✑ Nick Parsons on Medium
✑ PCMag
Hope you liked this thread!
For more content, follow us on Twitter 
Want to get UNDER HYPED companies delivered straight to your inbox
Don’t MISS IT
https://getbenchmark.substack.com




