Selecting a Database for Your Data Needs

By Deepak Vohra - August 27, 2019

Data requirements vary, and with so many different types of databases, selecting which to use can be a tricky choice. Let’s look at the popular database types so you can make the decision about which would be best for your situation.

Relational Database

In a relational database, data is stored in a table format with rows and columns. A table schema typically defines the data definitions, including column names, column types, primary key, foreign key, and any constraints, like that a column value must be unique or must not be null. Select a relational database, such as MySQL or PostgreSQL, if the data to be stored is structured and has relationships with other data.

NoSQL Database

If the data to be stored is not relational, a NoSQL database is the best option. NoSQL does not imply no SQL is used at all, but instead refers to the non-relational aspect of the database, with no support for a foreign key, joins, or referential integrity. Most NoSQL databases support a SQL-like query language. A NoSQL database typically has features such as schema-free and unstructured data, and they’re further categorized based on the type of storage, such as document and key-value. An example of a document database is MongoDB, and an example of a key-value database is Apache Cassandra.

Search Engine

A search engine is a database that stores and indexes data to make it queryable. Search engines are also typically non-relational, but their salient characteristic is that they are optimized for search, with features such as support for complex search expressions using regular expressions, full text search, and grouping of search results. Apache Lucene is an open source, full-text search library commonly used by search engines, including Elasticsearch and Apache Solr.

Graph Database

A graph database stores data in a graph structure, which has nodes and edges representing relationships between nodes. Graphs are used for data that has several relationships because they provide simple and efficient querying of highly interconnected data. The query language used by graph databases is different from SQL because of the complexity of the queries, and some languages have been developed just for querying a graph databases, such as Cypher. Examples of graph databases are Neo4j and ArangoDB.

In-Memory Database

An in-memory database stores data in the process memory instead of on a disk, which allows for faster response time. But a disadvantage is that a server or process failure would cause the data to be lost. Examples of open source in-memory databases are Redis and ArangoDB.

Multi-Model Database

A multi-model database supports more than one data storage model, offering flexibility in storage. Some of the databases mentioned previously are multi-model databases: MySQL and PostgreSQL are also document stores, and ArangoDB is a multi-model database with support for in-memory, document, and key-value storage.

Choose which database to use based on what your database will be used for, the data structure, and the scale of your data.

Tags

data storage

0 comments

Deepak is a Sun Certified Java Programmer and Web Component Developer, and has worked in the fields of XML, Java programming and Java EE for ten years. Deepak is the co-author of the Apress book Pro XML Development with Java Technology and was the technical reviewer for the O'Reilly book WebLogic: The Definitive Guide. Deepak was also the technical reviewer for the Course Technology PTR book Ruby Programming for the Absolute Beginner. Deepak is also the author of the Packt Publishing books JDBC 4.0 and Oracle JDeveloper for J2EE Development, Processing XML Documents with Oracle JDeveloper 11g, EJB 3.0 Database Persistence with Oracle Fusion Middleware 11g, and Java EE Development in Eclipse IDE. Deepak is a Docker Mentor and has published 5 books on Docker and Kubernetes.