Starting a Data Model With Repods

Repods is a data platform that can create and manage data pods. These pods are compact data warehouses with flexible storage, vCores, memory, and all required tooling. You can manage personal data projects, work together in a private team, or collaborate on open data in public data pods.

Before we start

Before creating a data pod, it is important to be aware of the scope of information that we have and need for our analysis. The goal is to create a data model that closely reflects the business entities of the subject area, without focusing on how reports are going to be created or how we are going to fill this data model with the given data. A good place to start is by answering the following questions:

How to Evaluate Data Platforms for Your Organization

Introduction

Companies and organizations generate data and are increasingly using this data to generate additional values. While traditionally this was a task for business administration analysts, today data plays an important role in all aspects and divisions of the organizations. To enable companies for this change, an efficient and long term data architecture is required. Here we are going to discuss many aspects and technical challenges that need to be addressed to build such a data architecture.

The motivation for this article came from the observation that data platforms often are reduced to the database component which is a huge oversimplification of the whole data lifecycle. We want to illustrate the amount of functionality that is required for a basic, long term data strategy in a company.