Insights and Case Studies

The latest thought leadership, trends and research from ProcessMiner

A note about our case studies: Delve deep into the world of pulp-and-paper manufacturing with a wealth of knowledge and thought leadership from our Director of Science, Chitta Ranjan, who shares his extensive research on data in the industry.

Importance Of Autonomous Manufacturing Amid The COVID-19 Outbreak

All manufacturers are adjusting to the new normal and with “Social Distancing” in full swing companies are making more aggressive investments in technology to achieve autonomous manufacturing.  This approach accomplishes a couple of important objectives…

Read More

Here we will learn the details of data preparation for LSTM models, and build an LSTM Autoencoder for rare-event classification. This post is a continuation of my previous post-Extreme Rare Event Classification using Autoencoders. In the previous post, we talked about the challenges in an extremely rare event data with less than 1% positively labeled data.

Read Case Study

In a rare-event problem, we have an unbalanced dataset. Meaning, we have fewer positively labeled samples than negative. In a typical rare-event problem, the positively labeled data are around 5–10% of the total. In an extremely rare event problem, we have less than 1% positively labeled data.

Read Case Study

Correlation estimations are commonly used in various data mining applications. In my experience, nonlinear correlations are quite common in various processes. Due to this, nonlinear models, such as SVM, are employed for regression, classification, etc.

Read Case Study

The Industrial Revolution paved the way for the life we know today and far surpassed the era of simplistic conveyor belts and heavy manual surveillance. Production lines employ machinery and humans alike. Industries have always stepped up with technological advancement and thereby use a plethora of devices designed and produced to meet specific tasks on factory grounds.

Read More

A real-world dataset is provided from the pulp-and-paper manufacturing industry. The dataset comes from a multivariate time series process. The data contains a rare event of paper break that commonly occurs in the industry. The data contains sensor readings at regular time-intervals (x’s) and the event label (y).

Read More

Here we will learn an approach to get vector embeddings for string sequences. These embeddings can be used for Clustering and Classification. Sequence modeling has been a challenge. This is because of the inherent un-structuredness of sequence data. Just like texts in Natural Language Processing (NLP), sequences are arbitrary strings.

Read Case Study

Researchers are often looking for interesting real-world problems. One major roadblock they face is real-world data. Here we are trying to serve the research community by providing a real-world problem and a dataset. This dataset is created from the pulp-and-paper manufacturing industry. Paper sheets are not the strongest material.

Read Case Study

A real-world dataset is provided from the pulp-and-paper manufacturing industry. The dataset comes from a multivariate time series process. The data contains a rare event of paper break that commonly occurs in the industry. The data contains sensor readings at regular time-intervals (x’s) and the event label (y).

Read Case Study