Abstract background, Picking the Right Machine Learning Algorithm

August 14, 2019

|

AI Industry Insights

Picking the Right Machine Learning Algorithm

No items found.

Deep Learning vs. Gradient Boosted Tree

One of the most important decisions when building a machine learning model is choosing the right algorithm. For many problems, there isn’t a simple answer. It’s not uncommon for machine learning engineers to try multiple algorithms before selecting the best approach. In recent years, deep learning has become the dominant machine learning technique, but it still has competitors. In particular, gradient boosted trees can often produce comparable, if not better solutions than deep learning, often with less hassle. In this post, we’ll look at the differences between these two popular machine learning techniques and how to pick the right one for your problem.

Machine learning is advancing at an incredible rate, with new state-of-the-art results generated regularly. Of the myriad machine learning techniques, however, deep learning and gradient boosted trees have consistently outperformed their competitors. They are powerful algorithms that have proven themselves across a wide range of problem domains and datasets.

‍Computer Vision and Natural Language Processing

‍Deep learning has been particularly successful in the fields of computer vision and natural language processing (NLP). Its performance in these domains is unrivaled, so there’s currently no reason to consider any other technique when working on problems in these fields. Deep learning works so well in these fields because it addresses something called the “representation problem.” For example, deep convolutional neural networks (CNNs) are a type of neural network architecture commonly used in computer vision. They work by looking at groups of pixels simultaneously. This allows them to use the spatial relationships between pixels to learn higher-order concepts like edges and patterns. Contextual representation is also important when solving NLP problems. Individual words in a sentence are not very useful by themselves. The surrounding words are necessary to understand its context and derive its meaning. Deep learning can find patterns in data using features in combination that carry little information by themselves. Gradient boosted trees, however, can only handle data that has individually informative features. If a feature doesn’t carry much information on its own, gradient boosted trees will have a tough time finding a good solution. If your problem involves computer vision or NLP, deep learning is the best approach.

‍Tabular Data

‍If you’re working with tabular data (e.g. spreadsheet-type data), gradient boosted trees can be an excellent choice. Gradient boosted trees require very little data pre-processing and handle missing data automatically. But there's an important caveat: deep learning can sometimes solve complex tabular data problems with a higher degree of success than gradient boosted trees. In general, gradient boosted trees work best on tabular data problems that have categorical features of limited size. A categorical feature is an input that can take on one of a fixed number of possible values (e.g. high, medium, low). If your problem contains categorical features with tens of thousands of possible values, deep learning is a better choice.

‍Explainability

‍Deep learning models are notoriously difficult to interpret. It’s common for modern deep learning models to have hundreds of hidden layers and billions of parameters, resulting in very low explainability. Conversely, gradient boosted trees are relatively easy to interpret and have good explainability. Generating feature importance plots on a trained gradient boosted tree is simple and allows you to directly observe the relationships the model has discovered. If interpretability and explainability are requirements for your machine learning model, gradient boosted trees are the best choice.

‍Speed

‍Interestingly, deep learning and gradient boosted trees take roughly the same amount of time to train. However, gradient boosted trees will almost certainly be faster after they’ve been trained. For problems where low model inference latency is required, gradient boosted trees should be preferred.

Trade-Offs

Deep Learning Pros:

Unsurpassed in computer vision and NLP
Handles extremely large input feature spaces
Works well in most problem domains

Deep Learning Cons:

Computationally expensive
Often require large amounts of training data
Hyperparameter tuning can be difficult
Very low explainability
Difficult to troubleshoot when training fails

Gradient Boosted Tree Pros:

Easy to interpret and explain
Very little data pre-processing required
Handles missing data well

Gradient Boosted Tree Cons:

Prone to overfitting
Not well-suited for some problem domains (e.g. images and text)

Summary

Use Neural Networks when:

Working in computer vision or NLP
Input features are hard to represent
Explainability isn’t important
Speed of the trained model is less important

Use Gradient Boosted Trees when:

Using tabular data
Features can be easily represented
Explainability is important
Speed of the trained model is important

The Super-Weight Phenomenon: What Hidden Parameters Reveal About Large Language Models

Does AI Coding Assistance Actually Improve Productivity?

2026: The Year AI Grows Up

How We Use AI to Engineer AI

Guiding America’s Boardrooms into the Age of AI

AI Leaders Summit: Exclusive One-on-one's with AI Experts

Don’t Poison Your Own Well with GenAI, Use it to Dig Deeper

You Made It to Production: Now What?

Rethinking the AI Development Lifecycle

Why 90% of AI Projects Fail Before They Launch

A Gold Medal Moment for AI

Part 3: How to Choose an AI Governance Model That Works for Your Organization

The Real Breakthrough Behind DeepSeek R1

Anthropic Cracks Open the Black Box of AI

Predicting Cancer Before It Starts: An AI Milestone in Women’s Health

Reinforcement Learning: AI’s Next Big Leap

Copyright, Fair Use, and the Fight Over AI Training Data

The Real Illusion in Apple’s “Illusion of Thinking” Paper

Part 2: Designing AI Governance That Works

Part 1: Why AI Governance is a Strategic Imperative

Most People Don't Expect AI to Benefit Them. What Can We Do About That?

From Brain to Machine: How Neuroscience Is Shaping the Future of AI

KUNGFU.AI Partners with NACD to Equip Boards for the Age of AI

What Does “Productivity” Mean in an AI-Enabled World?

The Emergence of Product Analytics: An Under-appreciated Yet Critical Part of AI Development

The Academic in Industry: A Cultural and Pragmatic Shift

AI & Authenticity—What Does It Mean to Be "Real" in 2025?

AI is Like a Road Trip: Why You Need a Flexible Strategy, Not Just a Destination

Why Most AI Implementations Fail—And How to Get It Right

Reclaiming Attention in the Age of AI

Are Agents the Future?

Tired of the Hype? Let’s Baseline 10 Commonly Misused AI Terms

KUNGFU.AI’s AI Hiring Survival Guide

Part 3: How to Procure AI Services Through an RFP Process

Data Science: Bridging the Gap Between Business and Analytics

Part 2: Planning for Next Year’s AI Budget: A Strategic Guide for C-Level Executives

Part 1: Building vs. Buying an AI Team: What’s Best for Your Business?

Mash-Up: AI and Potatoes USA Join Forces Against Misinformation

KUNGFU.AI Updates Ethical Pledge on Facial Recognition

3 Steps to Designing AI That Fits Like a Glove

LLMs are Engines. It’s Time for Vehicles.

Product Sense: A Hidden Lynchpin in Data Science and AI

Not Budgeting for AI Today is like Having Bet on the Slide Rule, Calculator or Fax

The Top AI Events We’re Looking Forward to in 2024

2024 Will Be The Year of The AI Budget

Engineering Explained: GPT-4V(ision)

KUNGFU.AI and CDAO Collaborate on AI Strategy for Defense Enterprise Ecosystem

Engineering Explained: Opportunity Sizing and ROI Analysis

Engineering Explained: Bayesian Mechanics

Celebrating Our Success: We Made the Inc. 5000 List of Fastest-Growing Private Companies in America!

10 Things Companies Should Think About When Devising an AI Strategy

Engineering Explained: Large Language Models

Engineering Explained: Diffusion Models

Understanding Data Science and Related Sub Sciences

KUNGFU.AI Joins Tradewinds’ Marketplace, Empowering Businesses with Cutting-Edge AI Services

How to Navigate the AI Industry: Join our Career Workshops

Innovation in the Age of Regulation: Building AI with Federated Learning

AI is the Future. ChatGPT is the assistant.

KUNGFU.AI’s Approach to Developing an ‘AI Center of Excellence’

KUNGFU.AI Joins INSA to Expand Government Partnerships and Reach

Data-Driven Decision-Making: Making Confident and Proactive Business Decisions

Navigating the Ethical Implications of Data Interpretation

Overcoming Cognitive Bias in Data Analysis and Decision-Making

ConvNeXt: A Transformer-Inspired CNN Architecture

How to Build a Great AI Engineering Team

Engineering Explained: LayoutLMv3 and the Future of Document AI

Turning Away Our First Client

AI Simplified: An Introduction to Artificial Intelligence

Introducing KUNGFU.AI Lab Days

Large Language Models: Three Stages of Adoption

The Future of AI: Can Open-Source Community Keep Up with Large Corporations?

How to Use ChatGPT: Our Step by Step Guide

What is ChatGPT? Everything You Need to Know.

Savimbo and KUNGFU.AI Partner to Bring AI to Rainforest Conservation

Data, Security, and Ethical Risks of AI Use in Healthcare

Engineering Explained: OpenAI's ChatGPT

4 Ways to Mitigate Bias and Prioritize Patients

We Used ChatGPT to Figure Out How Businesses Can Use ChatGPT

Want to WFH? Check Out These 10 Flexible Remote Companies

Where We Are and What's Coming

Meet the Team: Benjamin Klein

The First Mile of Any AI Project is Most Critical

Edge Computing for Business: What You Should Know

What You Should Know Before Investing in Computer Vision

KUNGFU.AI Presents: Using Computer Vision to Solve Business Challenges with WM

KUNGFU.AI Presents: Unlocking Greater Business Intelligence with Graphs

How Multitask Learning in Computer Vision Can Solve Your Business Challenges

Now Is the Time to Invest in Computer Vision and Secure a Competitive Advantage

Designing Your First NLP Annotation Job

Autism Acceptance Day

KUNGFU.AI Announces Chief Growth Officer and Record Growth

5 Ways to Realize ROI on AI investments

Join Us for Giving Tuesday

KUNGFU.AI Achieves Machine Learning Partner Specialization in the Google Cloud Partner Advantage Program

KUNGFU.AI Presents: The Obstacles in Building Product AI and How to Overcome Them

KUNGFU.AI Presents: The AI Ethical Imperative

Want to win with AI? Focus on your leadership, not the competition.

KUNGFU.AI Partners with Parasanti to Support U.S. Navy Foreign Object Detection Project

KUNGFU.AI and makepath Partner to Demonstrate Power of Machine Learning and Data Visualization

Deadline 2024: Why you only have 3 years left to adopt AI

Related resources

No items found.