Learn data science best practices.


Sahiba ChopraFebruary 7, 2019

An Introduction To Building a Classification Model Using Random Forests In Python

A random forest is an ensemble machine learning algorithm that is used for classification and regression problems. In this tutorial, learn how to build a random forest, use it to make predictions,...

Setareh BorjianFebruary 4, 2019

The Benefits of a Hybrid Software Solution

Oracle Principal Data Scientist Setareh Borjian looks at the disadvantages of a pre-packaged software solution and considers the practicality of a hybrid solution.

Gautam SingarajuJanuary 31, 2019

Introduction to Embedding in Natural Language Processing

Oracle Digital Assistant Architect and Software Lead Gautam Singaraju, Ph.D. gives an introduction on word embeddings along with techniques that embed higher dimensional artifacts into low...

Joe HahnJanuary 30, 2019

Predictive Maintenance for Upstream Oil and Gas

Oracle Data Scientist Joe Hahn presents a toy-model simulation to assess a predictive maintenance strategy that can be applied to upstream oil and gas.

Sebastian NeubauerJanuary 28, 2019

Why Is It so Hard to Put Data Science in Production?

Is data science helping your company build the systems that automate operational decisions? If not, Senior Data Scientist Sebastian Neubauer believes that adopting a DevOps mentality in your data...

Mats StellwallJanuary 24, 2019

Overview of Traditional Machine Learning Techniques

Oracle Data Scientist Mats Stellwall goes over four techniques that are considered traditional machine learning, including clustering, classification, regression, and market basket analysis.

Manish BhogeJanuary 23, 2019

Using the Artificial Neural Network for Credit Risk Management

Oracle Financial Software Services Solution Architect Manish Bhoge introduces the Artificial Neural Network and how it can used in credit risk analysis.

Shashank Shekhar RaiJanuary 17, 2019

5 Data Cleaning Tips to Test Assumptions

In this post, machine learning practitioner Shashank Shekhr Rai offers five tips that any data scientist or analyst can use as data checks and a way to second guess any assumptions that may creep...

Gyasi DapaaJanuary 16, 2019

Target Twisted: Avoid Creating Biases in Loss Cost Modeling

Actuarial rate-making thought leader Gyasi Kwabena Dapaa explains why actuarial trend rates are not suitable for trending target loss variables of insurance predictive models.

Vikram ReddyJanuary 14, 2019

A Business Perspective to Designing an Enterprise-Level Data Science Pipeline

Oracle Senior Data Scientist Vikram Reddy walks through a case study and illustrates key things to keep in mind when designing a data science pipeline.