Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

MEBoost: Mixing Estimators with Boosting for Imbalanced Data Classification (1712.06658v2)

Published 18 Dec 2017 in cs.LG and stat.ML

Abstract: Class imbalance problem has been a challenging research problem in the fields of machine learning and data mining as most real life datasets are imbalanced. Several existing machine learning algorithms try to maximize the accuracy classification by correctly identifying majority class samples while ignoring the minority class. However, the concept of the minority class instances usually represents a higher interest than the majority class. Recently, several cost sensitive methods, ensemble models and sampling techniques have been used in literature in order to classify imbalance datasets. In this paper, we propose MEBoost, a new boosting algorithm for imbalanced datasets. MEBoost mixes two different weak learners with boosting to improve the performance on imbalanced datasets. MEBoost is an alternative to the existing techniques such as SMOTEBoost, RUSBoost, Adaboost, etc. The performance of MEBoost has been evaluated on 12 benchmark imbalanced datasets with state of the art ensemble methods like SMOTEBoost, RUSBoost, Easy Ensemble, EUSBoost, DataBoost. Experimental results show significant improvement over the other methods and it can be concluded that MEBoost is an effective and promising algorithm to deal with imbalance datasets. The python version of the code is available here: https://github.com/farshidrayhanuiu/

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Farshid Rayhan (6 papers)
  2. Sajid Ahmed (13 papers)
  3. Asif Mahbub (3 papers)
  4. Md. Rafsan Jani (3 papers)
  5. Swakkhar Shatabda (35 papers)
  6. Chowdhury Mofizur Rahman (6 papers)
  7. Dewan Md. Farid (7 papers)
Citations (23)

Summary

We haven't generated a summary for this paper yet.