Blogs

Home > Blogs

Analyzing Open-Source Package Sustainability: Part 4 – From Clustering to Classification

In this blog, we'll walk you through our transition to supervised classification, share how we tackled some intriguing challenges, and reveal how we expanded our model to handle multiple ecosystems. Whether you're a data science enthusiast or a curious ML developer, there's something here for everyone!

Read More

Analyzing Open-Source Package Sustainability: Part 3 – Focusing on Data Preprocessing

Effective data preprocessing is key to reducing outliers and unlocking the true potential of open-source sustainability insights.

In this blog we’ll walk you through cleaning and scaling the collected data in order to address issues like missing or inconsistent information, transform data into a suitable format, and create composite metrics to better assess the sustainability of open-source packages.


Read More

Analysing Open-Source Packages Sustainability using Machine Learning: Part-1: Introduction

In today’s fast-paced digital world, managing software dependencies is crucial to avoid security risks and technical debt. The Package Sustainability Scanner (PSS), powered by machine learning and data modeling, evaluates the long-term viability of open-source packages from the ecosystems such as PyPI and npm by analyzing maintainability, engagement and community support.

Read More