NoSQL Tutorial: Types of NoSQL Databases & Example
โก Slimme samenvatting
NoSQL is a non-relational database management system that does not require a fixed schema, avoids joins, and scales easily. This resource explains what NoSQL is, why it exists, its history, features, the four database types, the CAP theorem, eventual consistency, and its advantages and disadvantages.

Wat is NoSQL?
NoSQL-database is a non-relational data management system that does not require a fixed schema. It avoids joins, and is easy to scale. The major purpose of using a NoSQL database is for distributed data stores with humongous data storage needs. NoSQL is used for big data and real-time web apps. For example, companies like Twitter, Facebook, and Google verzamelen elke dag terabytes aan gebruikersgegevens.
NoSQL-database stands for โNot Only SQLโ or โNot SQLโ. Though a better term would be โNoRELโ, NoSQL caught on. Carl Strozzi introduced the NoSQL concept in 1998.
Traditional RDBMS uses SQL syntax to store and retrieve data for further insights. Instead, a NoSQL database system encompasses a wide range of database technologies that can store structured, semi-structured, unstructured, and polymorphic data. Let us understand about NoSQL with a diagram in this NoSQL database tutorial:
Waarom NoSQL?
Het concept van NoSQL-databases werd populair bij internetgiganten zoals Google, Facebook, Amazon, enz. die te maken hebben met enorme hoeveelheden gegevens. De reactietijd van het systeem wordt traag als u RDBMS gebruikt voor enorme hoeveelheden gegevens.
Om dit probleem op te lossen, zouden we onze systemen kunnen โopschalenโ door onze bestaande hardware te upgraden. Dit proces is duur.
The alternative for this issue is to distribute the database load on multiple hosts whenever the load increases. This method is known as โscaling outโ.
NoSQL database is non-relational, so it scales out better than relational databases, as they are designed with web applications in mind.
Korte geschiedenis van NoSQL-databases
- 1998 โ Carlo Strozzi uses the term NoSQL for his lightweight, open-source relational database.
- 2000 โ Graph database Neo4j is launched.
- 2004 - Google BigTable is launched.
- 2005 - CouchDB wordt gelanceerd.
- 2007 โ The research paper on Amazon Dynamo is released.
- 2008 โ Facebook open sources the Cassandra project.
- 2009 โ The term NoSQL was reintroduced.
Kenmerken van NoSQL
Niet-relationeel
- NoSQL-databases volgen nooit de relationeel model.
- Never provide tables with flat fixed-column records.
- Work with self-contained aggregates or BLOBs.
- Do not require object-relational mapping and data normalization.
- No complex features like query languages, query planners, referential integrity joins, or ACID.
Schemavrij
- NoSQL databases are either schema-free or have relaxed schemas.
- Do not require any sort of definition of the schema of the data.
- Offer heterogeneous structures of data in the same domain.

Eenvoudige API
- Offers easy-to-use interfaces for storage and querying data.
- APIs allow low-level data manipulation and selection methods.
- Text-based protocols mostly used with HTTP REST with JSON.
- Mostly used no standard-based NoSQL query language.
- Web-enabled databases running as internet-facing services.
Distributed
- Multiple NoSQL databases can be executed in a distributed fashion.
- Offers auto-scaling and fail-over capabilities.
- Often the ACID concept can be sacrificed for scalability and throughput.
- Mostly no synchronous replication between distributed nodes; asynchronous multi-master replication, peer-to-peer, HDFS replication.
- Only providing eventual consistency.
- Shared-nothing architecture. This enables less coordination and higher distribution.
Soorten NoSQL-databases
NoSQL-databases are mainly categorized into four types: Key-value pair, Column-oriented, Graph-based, and Document-oriented. Every category has its unique attributes and limitations. None of the above-specified databases is better at solving all the problems. Users should select the database based on their product needs.
Soorten NoSQL-databases:
- Sleutel-waarde paargebaseerd
- Kolomgeoriรซnteerde grafiek
- Op grafieken gebaseerd
- Documentgericht
Sleutelwaardepaar gebaseerd
Data is stored in key/value pairs. It is designed in such a way to handle lots of data and heavy load. Key-value pair storage databases store data as a hash table where each key is unique, and the value can be a JSON, BLOB (Binary Large Objects), string, etc.
Een sleutel-waardepaar kan bijvoorbeeld een sleutel bevatten zoals 'Website', gekoppeld aan een waarde zoals 'Guru99 ".
It is one of the most basic NoSQL database examples. This kind of NoSQL database is used as a collection, dictionaries, associative arrays, etc. Key-value stores help the developer to store schema-less data. They work best for shopping Inhoud van de winkelwagen.
Redis, Dynamo, and Riak are some NoSQL examples of key-value store databases. They are all based on Amazon's Dynamopapier.
Op kolommen gebaseerd
Column-oriented databases work on columns and are based on the BigTable paper by Google. Every column is treated separately. Values of single-column databases are stored contiguously.
They deliver high performance on aggregation queries like SUM, COUNT, AVG, MIN, etc., as the data is readily available in a column. Column-based NoSQL databases are widely used to manage data warehouses, business intelligence, CRM, and library card catalogs.
HBase, Cassandra, and Hypertable are NoSQL query examples of column-based databases.
Documentgericht
Document-Oriented NoSQL DB stores and retrieves data as a key-value pair, but the value part is stored as a document. The document is stored in JSON or XML formats. The value is understood by the DB and can be queried.
In this diagram on your left, you can see we have rows and columns, and on the right, we have a document database which has a similar structure to JSON. Now for the relational database, you have to know what columns you have, and so on. However, for a document database, you have a data store like a JSON object. You do not need to define it, which makes it flexible.
The document type is mostly used for CMS systems, blogging platforms, real-time analytics, and e-commerce applications. It should not be used for complex transactions which require multiple operations or queries against varying aggregate structures.
Amazon SimpelDB, CouchDB, MongoDB, Riak, and Lotus Notes are popular document-oriented DBMS-systemen.
Op grafieken gebaseerd
A graph type database stores entities as well as the relations amongst those entities. The entity is stored as a node with the relationship as edges. An edge gives a relationship between nodes. Every node and edge has a unique identifier.
Compared to a relational database where tables are loosely connected, a graph database is multi-relational in nature. Traversing relationships is fast, as they are already captured in the DB, and there is no need to calculate them. Graph base databases are mostly used for social networks, logistics, and spatial data.
Neo4J, Oneindige grafiek, OrientDB, and FlockDB are some popular graph-based databases.
Querymechanismehulpmiddelen voor NoSQL
The most common data retrieval mechanism is the REST-based retrieval of a value based on its key/ID with a GET resource.
Document store databases offer more difficult queries, as they understand the value in a key-value pair. For example, CouchDB allows defining views with MapReduce.
Wat is het CAP-theorema?
CAP theorem is also called Brewerโs theorem. It states that it is impossible for a distributed data store to offer more than two out of three guarantees:
- Consistentie
- Beschikbaarheid
- Partitie tolerantie
Consistentie: De data moet consistent blijven, zelfs na de uitvoering van een bewerking. Dit betekent dat zodra data is geschreven, elke toekomstige leesaanvraag die data moet bevatten. Bijvoorbeeld, na het updaten van de orderstatus, moeten alle clients dezelfde data kunnen zien.
Beschikbaarheid: De database moet altijd beschikbaar en responsief zijn. Het mag geen downtime hebben.
Partitietolerantie: Partitietolerantie betekent dat het systeem moet blijven functioneren, zelfs als de communicatie tussen de servers niet stabiel is. De servers kunnen bijvoorbeeld worden opgedeeld in meerdere groepen die mogelijk niet met elkaar communiceren. Als een deel van de database hier niet beschikbaar is, blijven andere delen altijd onaangetast.
Eventuele consistentie
The term โeventual consistencyโ means to have copies of data on multiple machines to get high availability and scalability. Thus, changes made to any data item on one machine have to be propagated to other replicas.
Data replication may not be instantaneous, as some copies will be updated immediately while others in due course of time. These copies may be mutually inconsistent, but in due course of time, they become consistent. Hence, the name eventual consistency.
BASE: Basisch Abeschikbaar, Svaak staat, Eventuele consistentie
- Basically available means the DB is available all the time as per the CAP theorem.
- Soft state means even without an input, the system state may change.
- Eventual consistency means that the system will become consistent over time.
Voordelen van NoSQL
- Can be used as a primary or analytic data source.
- Big data capability.
- No single point of failure.
- Easy replication.
- No need for a separate caching layer.
- Het biedt snelle prestaties en horizontale schaalbaarheid.
- Can handle structured, semi-structured, and unstructured data with equal effect.
- Object-oriented programming which is easy to use and flexible.
- NoSQL databases do not need a dedicated high-performance server.
- Support key developer languages and platforms.
- Simpler to implement than using RDBMS.
- Het kan dienen als de primaire gegevensbron voor online applicaties.
- Handles big data which manages data velocity, variety, volume, and complexity.
- Excels at distributed database and multi-data center operations.
- Eliminates the need for a specific caching layer to store data.
- Offers a flexible schema design which can easily be altered without downtime or service disruption.
Nadelen van NoSQL
- No standardization rules.
- Limited query capabilities.
- RDBMS databases and tools are comparatively mature.
- Het biedt geen traditionele databasemogelijkheden, zoals consistentie wanneer meerdere transacties tegelijkertijd worden uitgevoerd.
- When the volume of data increases, it is difficult to maintain unique values as keys become difficult.
- Does not work as well with relational data.
- The learning curve is stiff for new developers.
- Open source options are not so popular for enterprises.






