Papers
Topics
Authors
Recent
Assistant
AI Research Assistant
Well-researched responses based on relevant abstracts and paper content.
Custom Instructions Pro
Preferences or requirements that you'd like Emergent Mind to consider when generating responses.
Gemini 2.5 Flash
Gemini 2.5 Flash 68 tok/s
Gemini 2.5 Pro 56 tok/s Pro
GPT-5 Medium 34 tok/s Pro
GPT-5 High 31 tok/s Pro
GPT-4o 84 tok/s Pro
Kimi K2 184 tok/s Pro
GPT OSS 120B 441 tok/s Pro
Claude Sonnet 4.5 33 tok/s Pro
2000 character limit reached

Ensemble Active Learning by Contextual Bandits for AI Incubation in Manufacturing (2310.06306v2)

Published 10 Oct 2023 in cs.LG and stat.ML

Abstract: It is challenging but important to save annotation efforts in streaming data acquisition to maintain data quality for supervised learning base learners. We propose an ensemble active learning method to actively acquire samples for annotation by contextual bandits, which is will enforce the exploration-exploitation balance and leading to improved AI modeling performance.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (80)
  1. Leveraging active learning and conditional mutual information to minimize data annotation in human activity recognition. Proceedings of the ACM on Interactive, Mobile, Wearable and Ubiquitous Technologies, 3(3):1–23, 2019.
  2. Digital twin-driven supervised machine learning for the development of artificial intelligence applications in manufacturing. International Journal of Computer Integrated Manufacturing, 33(5):429–439, 2020.
  3. The nonstochastic multiarmed bandit problem. SIAM journal on computing, 32(1):48–77, 2002.
  4. Bandits with knapsacks. Journal of the ACM (JACM), 65(3):1–55, 2018.
  5. Margin based active learning. In International Conference on Computational Learning Theory, pages 35–50. Springer, 2007.
  6. Online choice of active learning algorithms. Journal of Machine Learning Research, 5(Mar):255–291, 2004.
  7. An online sparse estimation-based classification approach for real-time monitoring in advanced manufacturing processes from heterogeneous sensor data. IIE Transactions, 48(7):579–598, 2016.
  8. Stochastic multi-armed-bandit problem with non-stationary rewards. Advances in neural information processing systems, 27:199–207, 2014.
  9. Importance weighted active learning. In Proceedings of the 26th annual international conference on machine learning, pages 49–56, 2009.
  10. Contextual bandit algorithms with supervised learning guarantees. In Proceedings of the Fourteenth International Conference on Artificial Intelligence and Statistics, pages 19–26. JMLR Workshop and Conference Proceedings, 2011.
  11. Exploration vs. exploitation in active learning: A bayesian approach. In The 2010 International Joint Conference on Neural Networks (IJCNN), pages 1–7. IEEE, 2010.
  12. A survey of predictive modeling on imbalanced domains. ACM Computing Surveys (CSUR), 49(2):1–50, 2016.
  13. Maximizing expected model change for active learning in regression. In 2013 IEEE 13th International Conference on Data Mining, pages 51–60. IEEE, 2013.
  14. Nicholas Carlini. Poisoning the unlabeled dataset of semi-supervised learning. arXiv preprint arXiv:2105.01622, 2021.
  15. Tensorflow data validation: Data analysis and validation in continuous ml pipelines. In Proceedings of the 2020 ACM SIGMOD International Conference on Management of Data, pages 2793–2796, 2020.
  16. Bayesian online classifiers for text classification and filtering. In Proceedings of the 25th annual international ACM SIGIR conference on Research and Development in Information Retrieval, pages 97–104, 2002.
  17. Xiaoyu Chen. Multiscale Quantitative Analytics of Human Visual Searching Tasks. PhD thesis, Virginia Tech, Jul 2021. URL http://hdl.handle.net/10919/104200.
  18. Variation analysis and visualization of manufacturing processes via augmented reality. In presented at the IIE Annual Conference. Proceedings, 2016.
  19. Inn: An interpretable neural network for ai incubation in manufacturing. ACM Transactions on Intelligent Systems and Technology (TIST), 13(5):1–23, 2022.
  20. Industrial cyberphysical systems: A backbone of the fourth industrial revolution. IEEE Industrial Electronics Magazine, 11(1):6–16, 2017.
  21. Exact distribution for the product of two correlated gaussian random variables. IEEE Signal Processing Letters, 23(11):1662–1666, 2016.
  22. A smart data annotation tool for multi-sensor activity recognition. In 2017 IEEE International Conference on Pervasive Computing and Communications Workshops (PerCom Workshops), pages 111–116. IEEE, 2017.
  23. Ralf: A reinforced active learning formulation for object class recognition. In 2012 IEEE Conference on Computer Vision and Pattern Recognition, pages 3626–3633. IEEE, 2012.
  24. A novel active learning regression framework for balancing the exploration-exploitation trade-off. Entropy, 21(7):651, 2019.
  25. Confidence interval estimation by bootstrap method for uncertainty quantification using random sampling method. Journal of Nuclear Science and Technology, 52(7-8):993–999, 2015.
  26. Ronald Aylmer Fisher. Design of experiments. Br Med J, 1(3923):554–554, 1936.
  27. Maurice Fréchet. Sur la loi de probabilité de l’écart maximum. Ann. Soc. Math. Polon., 6:93–116, 1927.
  28. A survey on instance selection for active learning. Knowledge and information systems, 35(2):249–283, 2013.
  29. Application of supervised machine learning for defect detection during metallic powder bed fusion additive manufacturing using high resolution imaging. Additive Manufacturing, 21:517–528, 2018.
  30. Steve R Gunn et al. Support vector machines for classification and regression. ISIS technical report, 14(1):5–16, 1998.
  31. Data quality for machine learning tasks. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining, pages 4040–4041, 2021.
  32. I. Guyon. Design of experiments for the nips 2003 variable selection benchmark. 2003.
  33. Fault detection using the k-nearest neighbor rule for semiconductor manufacturing processes. IEEE transactions on semiconductor manufacturing, 20(4):345–354, 2007.
  34. Active learning by learning. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 29, 2015.
  35. Detecting cognitive hacking in visual inspection with physiological measurements. Applied ergonomics, 84:103022, 2020.
  36. J Stuart Hunter. The exponentially weighted moving average. Journal of quality technology, 18(4):203–210, 1986.
  37. High density-focused uncertainty sampling for active learning over evolving stream data. In Proceedings of the 3rd International Workshop on Big Data, Streams and Heterogeneous Source Mining: Algorithms, Systems, Programming Models and Applications, pages 133–148. PMLR, 2014.
  38. Ensemble modeling for data fusion in manufacturing process scale-up. IIE Transactions, 47(3):203–214, 2015.
  39. Sequential measurement strategy for wafer geometric profile estimation. Iie transactions, 44(1):1–12, 2012.
  40. Dynamic quality-process model in consideration of equipment degradation. Journal of Quality Technology, 51(3):217–229, 2019.
  41. Computer aided design of experiments. Technometrics, 11(1):137–148, 1969.
  42. Logistic regression. Springer, 2002.
  43. Semi-supervised regression: A recent review. Journal of Intelligent & Fuzzy Systems, 35(2):1483–1500, 2018.
  44. Chen Quin Lam. Sequential adaptive designs in computer experiments for response surface model fit. PhD thesis, The Ohio State University, 2008.
  45. A sequential algorithm for training text classifiers. In SIGIR’94, pages 3–12. Springer, 1994.
  46. Integration of physically-based and data-driven approaches for thermal field prediction in additive manufacturing. Materials & Design, 139:473–485, 2018.
  47. Cluster-based data filtering for manufacturing big data systems. Journal of Quality Technology, pages 1–13, 2021.
  48. Stratified sampling meets machine learning. In International conference on machine learning, pages 2320–2329. PMLR, 2016.
  49. Stream-based active unusual event detection. In Asian Conference on Computer Vision, pages 161–175. Springer, 2010.
  50. Stream-based joint exploration-exploitation active learning. In 2012 IEEE Conference on Computer Vision and Pattern Recognition, pages 1560–1567. IEEE, 2012.
  51. Quadratic forms in random variables: theory and applications. Dekker, 1992.
  52. Balancing exploration and exploitation: A new algorithm for active machine learning. In Fifth IEEE International Conference on Data Mining (ICDM’05), pages 8–pp. IEEE, 2005.
  53. Scikit-learn: Machine learning in Python. Journal of Machine Learning Research, 12:2825–2830, 2011.
  54. John Platt et al. Probabilistic outputs for support vector machines and comparisons to regularized likelihood methods. Advances in large margin classifiers, 10(3):61–74, 1999.
  55. Online real-time quality monitoring in additive manufacturing processes using heterogeneous sensors. Journal of Manufacturing Science and Engineering, 137(6), 2015.
  56. Burr Settles. Active learning. Synthesis lectures on artificial intelligence and machine learning, 6(1):1–114, 2012.
  57. Query by committee. In Proceedings of the fifth annual workshop on Computational learning theory, pages 287–294, 1992.
  58. Fully-sequential space-filling design algorithms for computer experiments. Journal of Quality Technology, 53(2):173–196, 2021.
  59. A human-centered wearable sensing platform with intelligent automated data annotation capabilities. In Proceedings of the International Conference on Internet of Things Design and Implementation, pages 255–260, 2019.
  60. Constrained maximin designs for computer experiments. Technometrics, 45(4):340–346, 2003.
  61. Eliza Strickland. Andrew Ng: Unbiggen AI, 2022. https://spectrum.ieee.org/andrew-ng-data-centric-ai [Accessed: 2022-02-18].
  62. Logistic regression for crystal growth process modeling through hierarchical nonnegative garrote-based variable selection. IIE Transactions, 48(8):787–796, 2016.
  63. Quality modeling of printed electronics in aerosol jet printing based on microscopic images. Journal of Manufacturing Science and Engineering, 139(7), 2017.
  64. Sebastian Thrun. Exploration in active learning. Handbook of Brain Science and Neural Networks, pages 381–384, 1995.
  65. Learning to learn. Springer Science & Business Media, 2012.
  66. Jan E Trost. Statistically nonrepresentative stratified sampling: A sampling technique for qualitative studies. Qualitative sociology, 9(1):54–57, 1986.
  67. Better data labelling with emblem (and how that impacts defect prediction). IEEE Transactions on Software Engineering, 2020.
  68. Large-scale online multitask learning and decision making for flexible manufacturing. IEEE Transactions on Industrial Informatics, 12(6):2139–2147, 2016.
  69. Online computation performance analysis for distributed machine learning pipelines in fog manufacturing. In 2020 IEEE 16th International Conference on Automation Science and Engineering (CASE), pages 1628–1633. IEEE, 2020.
  70. Pyramid ensemble convolutional neural network for virtual computed tomography image prediction in a selective laser melting process. Journal of Manufacturing Science and Engineering, 143(12):121003, 2021a.
  71. Moss—multi-modal best subset modeling in smart manufacturing. Sensors, 21(1):243, 2021b.
  72. Active learning in multimedia annotation and retrieval: A survey. ACM Transactions on Intelligent Systems and Technology (TIST), 2(2):1–21, 2011.
  73. Learning with uncertainty. CRC Press, 2016.
  74. Ral-improving stream-based active learning by reinforcement learning. In European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD) Workshop on Interactive Adaptive Learning (IAL), 2019.
  75. Recent advances in prognostics and health management for advanced manufacturing paradigms. Reliability Engineering & System Safety, 178:255–268, 2018.
  76. Akm2d: An adaptive framework for online sensing and anomaly quantification. IISE Transactions, 52(9):1032–1046, 2020.
  77. Fog computing for distributed family learning in cyber-manufacturing modeling. In 2019 IEEE International Conference on Industrial Cyber Physical Systems (ICPS), pages 88–93. IEEE, 2019.
  78. Oui! outlier interpretation on multi-dimensional data via visual analytics. In Computer Graphics Forum, volume 38, pages 213–224. Wiley Online Library, 2019.
  79. Introduction to semi-supervised learning. Synthesis lectures on artificial intelligence and machine learning, 3(1):1–130, 2009.
  80. Martin Zinkevich. Online convex programming and generalized infinitesimal gradient ascent. In Proceedings of the 20th international conference on machine learning (icml-03), pages 928–936, 2003.
Citations (3)

Summary

We haven't generated a summary for this paper yet.

Lightbulb Streamline Icon: https://streamlinehq.com

Continue Learning

We haven't generated follow-up questions for this paper yet.

List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.