Skip to main content

In the first post in this series, we considered the relationship between Data Science and the Digital Thread. In this post, we will discuss what need to put these ideas into practice. First, obviously, is data. We want all the data about the system and a way to get to it. For that, I’m going to use Syndeia™, the Digital Thread platform from Intercax. It builds and manages a network of inter-model connections between models in the engineering applications shown in Figure 1, as well as structured data in various formats like XML, SQL, CSV and others. It also provides a mechanism to get to that data.

Data-Science-digitalthread-syndeia

Figure 1  The Digital Thread integrations in Syndeia Release 3.4

The good news for Data Science is that all this data is structured. The bad news is that there are a lot of different structures. Our second need is a common data model that allows us as data scientists to structure our queries across the entire Digital Thread, the full dataset. For Syndeia, we use four element types, Repositories, Containers, Artifacts and Relations, as shown in Figure 2. All data is treated as one of these elements or their attributes.

data model

Figure 2 Common Data Model in Syndeia

Third, we need the analysis tools. Our objective at Intercax is to make the data available and let data scientists use the tools they already know and love. This series will consider two widely used open-source Data Science tools, TinkerPop graph analysis and Jupyter Notebooks (Figure 3).

TinkerPop originated back in 2009 as an open-source software project managed by the Apache Software Foundation. It offers a common interface supported by many open source and proprietary graph databases including a common graph analysis language, Gremlin, for searching and querying.

Our second tool will be the Jupyter Notebook, a web-based interactive computational notebook that emerged out of the iPython open-source project in 2014. It consists of a set of ordered cells containing API calls, computation, text and visualization and there are many open-source data science libraries.  We will use the Python language in our examples, although it can support other languages as well.

data science tools

Figure 3 Two Open-Source tools for Data Science

In Part 3 (forthcoming), we will dive more deeply into data structures in our Digital Thread. The final posts will demonstrate some of these ideas in action.

For more blogs in the series:

Related Posts

Pipelines Part 3 – Matching Requirements to Parts using Syndeia Digital Pipelines

We’re continuing our blog series on using Syndeia digital pipelines to execute real world use cases in systems engineering. In Part 2, we demonstrated a digital pipeline that ...
Dirk Zwemer and Gregory Seeds

Pipelines Part 2 – Gap Analysis between Requirements and Tests

Demonstrating measurable ROI from digital engineering is just as important in driving enterprise adoption as customer mandates, e.g. DoDI 5000.97. In particular, the concept of ...
Dirk Zwemer and Gregory Seeds

Pipelines Part 1: Quick Introduction and Demo

Syndeia Pipelines automate digital engineering workflows by orchestrating complex, multi-step tasks, such as ETL operations, cross-repository queries, model transformations/syncs, ...
Manas Bajaj

Christmas at the North Pole, Powered by Pipelines🎄

While the rest of the world is busy hanging lights, doing last minute gift shopping, and sipping cocoa, the North Pole is deep in digital engineering mode. Snow is falling, elves ...
Manas Bajaj

What’s New in Syndeia 3.7? - Part 1

We are excited to release Syndeia 3.7, the next generation of our digital thread platform for integrated digital engineering. Check out the latest features in less than 5 minutes ...
Dirk Zwemer

Model-Based Systems Engineering for Autonomous Vehicles, Part 16 – Custom Dashboards

In Parts 1-10 of this blog series, we built a digital thread for an autonomous vehicle system to demonstrate how a federation of models in different software tools can become a ...
Dirk Zwemer

Model-Based Systems Engineering for Autonomous Vehicles, Part 15 – Digital Pipelines

In Part 14 of this series, we developed a custom script to calculate project metrics for our Autonomous Vehicle digital thread project. The value of this information is greatest ...
Dirk Zwemer

Model-Based Systems Engineering for Autonomous Vehicles, Part 14 – Open REST API

Syndeia has been developed as an API-first enterprise application, i.e. the full capabilities of the software are exposed through an open REST API with the understanding that the ...
Dirk Zwemer

Model-Based Systems Engineering for Autonomous Vehicles, Part 13 – Digital Reports

A key function of Digital Threads is to be able to answer questions about project status in real-time without the overhead of data collection, status reports and meetings. In this ...
Dirk Zwemer