Store All Data and Build Trust in it



Many types of data flow in and out of organizations, driving decisions and operations. Datanova Trusted Data Lake™ helps organizations face three big challenges:

  1. Storing and managing all this disparate information effectively
  2. Connecting and making sense of disparate data streams
  3. Quantifying the trust that can be placed in the data, and in any analysis on the data

Datanova Trusted Data Lake™ (TDL) puts rich management, trust-building, evidence-tracking, and optimization on all data in a secure enterprise cloud. All types of data come together to form a knowledge-base for the organization. The modern Organization data can be securely distributed to any type of user. Trusted Data Lake provides an easy understanding of the data, and drives analysis and compliance applications.


TDL can Store All Datatypes at Exabyte Scale

TDL can pull data from various databases into one, mitigating the need for multiple licenses. TDL can also pull in files like PDFs and images. All this data is managed under one umbrella. Various data sources can be linked to each other as well as Knowledge Models to create new intelligence. Notably, multiple datasets can be corrected and fused to create a 'golden dataset' - providing a master reference dataset.

Evidence building and trust are enabled by pedigree and provenance. Provenance is the full chain of custody for a datum, and Pedigree is the measure of trust in the correctness of the Datum. Datanova TDL has built-in pedigree and provenance models. These customizable knowledge models can be tuned to an organization's specific needs.

Many modern organizations have legal, security, and compliance obligations. At the same time, they need to enable unhindered storage, analysis and collaboration. Datanova TDL's patent pending technology crosses this divide.


The Data Lake for Professional Grade Use

This is the Big Data solution we have been waiting for.

- Intelligence Community Branch Chief


Optimize All Data

Modern corporations need to leverage both traditional (reports, relational) and non-traditional data assets (text, social media, web, sensor, 3rd party data, etc.) to stay competitive. Effortlessly ingest and exploit structured, unstructured, and semi-structured data. Connect to NoSQL and traditional databases, reports, PDFs, spreadsheets, and streaming data. Put all data in one silo-less system.

Create Golden Datasets

Fuse entities distributed and duplicated in multiple databases. Create authoritative ‘Golden Datasets’. For example, there may be many databases that refer to people (customers, call-logs, accounts…). The modern Organizationse can be fused to form a dataset of ‘people we interact with’ that is enriched from all sources.

Secure and Distribute

Use intelligence-community grade security to distribute data without worry. Customize access at the datum level, data-model level, or organization level. Use RBAC or ABAC to manage data access policies. Secure data at rest and in motion.

Build Trust and Track Evidence

Maintain full situational awareness of your data. Track data as it flows in the enterprise - ingestion, transformations, usage, and analysis. A customizable Provenance ontology keeps track of the “who, what, how, and why” as data is used. A customizable Pedigree ontology keeps track of the impact to data trust and quality as it is manipulated.

Knowledge Models and Semantics

Future-proof organizational and operational knowledge in expressive knowledge models. While databases and data models change with technology, knowledge models change with the business domain. Capture expert knowledge make it available to all users. Knowledge models are more expressive than any competing product.

Lower Costs

Datanova’s advanced knowledge modeling technology drastically lowers the development cost for enterprise and big-data systems.

Multiple Distribution Modes

Organizations of all sizes can allow users to access data and knowledge through a web browser, and enable them to analyze and understand this information efficiently. Data can also be exported using modern bulk or streaming formats. Choose from a variety of distribution standards.

Single-Point Governance

Manage all your data in one silo-less system. Set customizable policies for security, interoperation, and optimization.

Link and Enrich

Create new intelligence by linking data to each other. Create and manage new types of relationships.

Data Quality

Improve data quality with data profilers and correctors. Setup automated data quality analysis and remediation.

Exabyte Scale

Leverage the true power of big data. Persist and manage data at internet scale. Datanova’s DSO™ analysis engine uses the power of cluster computing to join and analyze big-data.

Open Standards and Open Architecture

Datanova technology follows open W3C standards – preventing vendor lock-in and increasing the utility of work done.


Summary Specification

OPERATING SYSTEMS

  • Red Hat Enterprise Linux 5 (x64)
  • Red Hat Enterprise Linux 6 (x64)
  • CentOS 5 (x64)
  • Microsoft Windows Server 2008 R2+
  • Microsoft Windows 7, 8, 8.1 64-bit
ANALYSIS
  • Datanova Data Science Office™
  • Microsoft Power BI™
  • Tableau™
  • Qlik™
INGESTED SOURCES
  • Relational - MySQL, PostgreSQL, MS SQL Server, Oracle, JDBC/ODBC, SQLite, Azure
  • NoSQL - MongoDB, Accumulo, ElasticSearch
  • File systems - HDFS, Linux, and Windows
  • ASCII, XML, JSON, JSON-LD, RDF TTL/other, CSV/TSV, Spark
  • Excel™ XLS/XLSX/ XLSM, PDF, HTM/HTML/XML, MDB/ACCDB
  • Qlik QVX, SAS SAS7BDAT, Tableau TDE
  • Spatial Data Files
VIRTUALIZED ENVIRONMENTS
  • Microsoft Azure
  • Amazon EC2
  • VMware ESX (private cloud deployments)
BIG COMPUTING
  • Hadoop Enabled: YES
  • Spark Enabled: YES
MIN HARDWARE
  • 8GB RAM
  • 30GB Disk Space
  • 2 Core CPU




     Call - 1.877.619.6682

     Email - info@datanovasci.com