Is there a powerful database system for time series data?

DatabasePerformanceTime Series

Database Problem Overview


In multiple projects we have to store, aggregate, evaluate simple measurement values. One row typcially consists of a time stamp, a value and some attributes to the value. In some applications we would like to store 1000 values per second and more. These values must not only be inserted but also deleted at the same rate, since the lifetime of a value is restricted to a year or so (in different aggregation steps, we do not store 1000/s for the whole year).

Until now, we have developped different solutions. One based on Firebird, one on Oracle and one on some self-made storage mechanism. But none of these are very satisfying solutions.

Both RDBMS solutions cannot handle the desired data flow. Besides that, the applications that deliver the values (e.g. device drivers) cannot be easily attached to databases, the insert statements are cumbersome. And finally, while having an SQL interface to the data is strongly desired, typical evaluations are hard to formulate in SQL and slow in the execution. E.g. find the maximum value with time stamp per 15 minutes for all measurements during the last month.

The self-made solution can handle the insertion rate and has a client-friendly API to do it, but it has nothing like a query language and cannot be used by other applications via some standard interface e.g. for reporting.

The best solution in my dreams would be a database system that:

  • has an API for very fast insertion
  • is able to remove/truncate the values in the same speed
  • provides a standard SQL interface with specific support for typical time series data

Do you know some database that comes near those requirements or would you approach the problem in a different way?

Database Solutions


Solution 1 - Database

Most other answers seem to mention SQL based databases. NoSQL based databases are far superior at this kind of thing.

Some Open source time-series databases:

Cloud-based:

Solution 2 - Database

influxdb :: An open-source distributed time series database with no external dependencies.

Solution 3 - Database

Consider IBM Informix Dynamic Server with the TimeSeries DataBlade.

That is, however, an extreme data rate that you are working with. (Not quite up to sub-atomic physics at CERN, but headed in that general direction.)


Fair disclosure: I work for IBM on the Informix DBMS, though not on the TimeSeries DataBlade per se.

Solution 4 - Database

Solution 5 - Database

You can try HDF5 for time series data. It is extremely fast for such applications.

Solution 6 - Database

As Jonathan Leffler said, you should try Informix Timeseries feature. It is included in all editions of Informix at no additional charge. You can take a look at the TimeSeries functions it supports:

IBM Informix Time series SQL routines

You can access the data through sql functions or virtual view interfaces, you can even insert into the view.

Attributions

All content for this solution is sourced from the original question on Stackoverflow.

The content on this page is licensed under the Attribution-ShareAlike 4.0 International (CC BY-SA 4.0) license.

Content TypeOriginal AuthorOriginal Content on Stackoverflow
QuestionKit FistoView Question on Stackoverflow
Solution 1 - DatabaseJoakimView Answer on Stackoverflow
Solution 2 - DatabaseA.N.View Answer on Stackoverflow
Solution 3 - DatabaseJonathan LefflerView Answer on Stackoverflow
Solution 4 - DatabaseMitch WheatView Answer on Stackoverflow
Solution 5 - DatabaseshoonyaView Answer on Stackoverflow
Solution 6 - DatabaseGeorgeStoneView Answer on Stackoverflow