Regression Example - Lightgbm Typing CST Test
Loading…
Regression Example — Lightgbm Code
Simple regression example with LightGBM.
import lightgbm as lgb
import numpy as np
from sklearn.datasets import make_regression
from sklearn.model_selection import train_test_split
from sklearn.metrics import mean_squared_error
X, y = make_regression(n_samples=100, n_features=3, noise=0.1, random_state=42)
X_train, X_test, y_train, y_test = train_test_split(X, y, test_size=0.2)
train_data = lgb.Dataset(X_train, label=y_train)
params = {'objective':'regression','metric':'rmse'}
model = lgb.train(params, train_data, num_boost_round=100)
y_pred = model.predict(X_test)
print('RMSE:', np.sqrt(mean_squared_error(y_test, y_pred)))Lightgbm Language Guide
LightGBM (Light Gradient Boosting Machine) is a fast, distributed, high-performance gradient boosting framework based on decision tree algorithms, used for ranking, classification, and many other machine learning tasks.
Primary Use Cases
- ▸Binary and multiclass classification
- ▸Regression problems
- ▸Ranking tasks (learning-to-rank)
- ▸Feature selection and importance analysis
- ▸Integration in ML pipelines for large-scale structured data
Notable Features
- ▸Faster training with histogram-based decision tree algorithm
- ▸Low memory usage compared to XGBoost
- ▸Supports parallel and GPU learning
- ▸Handles categorical features directly
- ▸Scales efficiently with large datasets
Origin & Creator
LightGBM was developed by Microsoft’s DMTK team and released in 2016 to provide a faster and more memory-efficient gradient boosting framework compared to existing solutions.
Industrial Note
LightGBM is widely used in Kaggle competitions, finance, advertising, recommendation systems, and any scenario requiring high-speed gradient boosting on large datasets.