Skip to content

Latest commit

 

History

History
347 lines (242 loc) · 20.3 KB

AL_applications.md

File metadata and controls

347 lines (242 loc) · 20.3 KB

AL Applications

In this chapter, we will divide the works into two sections: scientific applications and industrial applications.

(Note that the works list in this pages are works I browsed in background reading. There might be more important works in the corresponding fields not listed here. Besides, I can't ensure the works are representative in the fields. So if you have any comments and recommendations, pls let me know.)

Scientific Applications (alphabetical order)

The scientific applications of AL are about biology, chemistry, physics or things about experiment design and analysis.

Astronomy

Spectroscopic Surveys

Reduce Simulation Costs

  • Active Learning for Computationally Efficient Distribution of Binary Evolution Simulations [2022]

Biology

Structural Biology:

  • Active machine learning for transmembrane helix prediction [2010, BMC Bioinform]
  • Investigating Active Learning and Meta-Learning for Iterative Peptide Design [2020, JCM]
  • Large scale active-learning-guided exploration for in vitro protein production optimization [2020, Nature Communications]
  • Active learning to classify macromolecular structures in situ for less supervision in cryo-electron tomography [Bioinformatics]

Cell Classification:

  • Active feature selection discovers minimal gene-sets for classifying cell-types and disease states in single-cell mRNA-seq data [2021]

Molecular Property Prediction:

Chemistry

Chemical natural language processing

  • Active Learning Yields Better Training Data for Scientific Named Entity Recognition [2020]

Experiment Design/Experimental Condition Selection

  • Active machine learning-driven experimentation to determine compound effects on protein patterns [2016, Elife]
  • Categorical Matrix Completion with Active Learning for High-throughput Screening [2020, IEEE Transactions on Computational Biology and Bioinformatics]: Categorical matrix completion is designed to accurately impute the missing experiments while margin sampling is also implemented for uncertainty estimation.
  • Active Learning Approach to Optimization of Experimental Control [2020, CHIN. PHYS. LETT.]: Also an optimization problem, where they try to get the optimal control parameters.
  • AI-Assisted Scientific Data Collection with Iterative Human Feedback [2021]
  • Active Learning for the Optimal Design of Multinomial Classification in Physics [2021]

Geology

Lithology Identification:

Materials

Materials Design and Discovery

Math and Statistics

Model checking:

Physics

Condensed Matter Physics

Molecular Dynamics:

  • [Active learning accelerates ab initio molecular dynamics on reactive energy surfaces [2020, Chem]
  • Active Learning in Bayesian Neural Networks for Bandgap Predictions of Novel Van der Waals Heterostructures [2021, Advanced Intelligent Systems]
  • Nanohardness from First Principles with Active Learning on Atomic Environments [2022, JCTC]
  • Batch active learning for accelerating the development of interatomic potentials [2022, Comput. Mater. Sci.]

Transition State Calculation:

  • Active Learning for Transition State Calculation

Quantum Physics

  • Active Learning of Quantum System Hamiltonians yields Query Advantage [2021]

Industrial Applications (alphabetical order)

The industrial applications of AL are about the practical problems or specific requirements in the industry.

Answer Selection

Autonomous Driving

Survey Paper:

  • Knowledge Augmented Machine Learning with Applications in Autonomous Driving: A Survey [2022]

Line detection:

  • Active Learning for Lane Detection: A Knowledge Distillation Approach [2021, ICCV]: Evaluate the uncertainty by the teacher-student models.

Trajectory annotation:

  • Engineering Applications of Artificial Intelligence [2022, Engineering Applications of Artificial Intelligence]

Brain Mapping in a High Performance Computing Environment

About neuron segmentation and tracing at scale.

Communication

Eg. 5g, Traffic Classification

Crowd Counting

Dataset Engineering

Dataset Building/ Data Annotation

  • Lsun: Construction of a large-scale image dataset using deep learning with humans in the loop [2015, Arxiv]
  • A Simple Yet Brisk And Efficient Active Learning Platform For Text Classification
  • Paladin: an annotation tool based on active and proactive learning [2021]
  • Scale AI
  • Appen (Figure-Eight)
  • ClickWorker

Data Enrichment

Design for Crowdworkers

  • Models in the Loop: Aiding Crowdworkers with Generative Annotation Assistants [2021]

Dialog Policy Learning (intelligent system)

Disguised Faces Recognition

Drug Discovery

  • Active-learning strategies in computer-assisted drug discovery. [2015, Drug Discov.]
  • Active learning for computational chemogenomics [2017, Future Medicinal Chemistry]
  • Evaluation of Categorical Matrix Completion Algorithms: Towards Improved Active Learning for Drug Discovery [2021, System Biology]
  • Active Learning for Drug Design: A Case Study on the Plasma Exposure of Orally Administered Drugs [2021, J MED CHEM]

Fault Diagnosis of Machinery

  • Fault Diagnosis of Rotating Machinery With Limited Expert Interaction: A Multicriteria Active Learning Approach Based on Broad Learning System [2022, ]

Gas Reservoir Prediction

Labeling System

This is the most direct industrial application.

Firms:

Papers:

Malware Detection

  • Q-learning and LSTM based deep active learning strategy for malware defense in industrial IoT applications [2021, Multimedia Tools and Applications]

Medical Research

Medical Image Classification / Image Annotation

  • Fine-tuning Convolutional Neural Networks for Biomedical Image Analysis: Actively and Incrementally [CVPR, 2017]: The CNN is fine-tuned in each active learning iteration incrementally. Assume that each candidate (predefined) takes one of possible labels Y. This assumption might make it more difficult to generalize.
  • Semi-Supervised Active Learning for COVID-19 Lung Ultrasound Multi-symptom Classification [2020]: Multi-label classification network (MSML) is proposed, and a human-machine interaction is exploited to confirm the final annotations that are used to fine-tune MSML with progressively labeled data. Still pool based.
  • An Adaptive Low-Rank Modeling-Based Active Learning Method for Medical Image Annotation [2020, IRBM]: Medical images: the intrinsic presence of noise in medical images, the large number of images, and the variety of imaging modalities. Low-rank modeling-based multi-label active learning (LRMMAL) method.
  • A Transfer Learning Based Active Learning Framework for Brain Tumor Classification [2020]
  • An Active Learning Method for Diabetic Retinopathy Classification with Uncertainty Quantification [2020]
  • Deep Reinforcement Active Learning for Medical Image Classification [2020]
  • MedSelect: Selective Labeling for Medical Image Classification Combining Meta-Learning with Deep Reinforcement Learning [2021]
  • Representative Region Based Active Learning For Histological Classification Of Colorectal Cancer [2021, ISBI]
  • Su-Sampling Based Active Learning For Large-Scale Histopathology Image [2021, ICIP]
  • PathAL: An Active Learning Framework for Histopathology Image Analysis [2021, T-MI]
  • DECAL: DEployable Clinical Active Learning [2022, ICML Workshop]
  • Patient Aware Active Learning for Fine-Grained OCT Classification [2022, ICIP]
  • Information Gain Sampling for Active Learning in Medical Image Classification [2022]
  • Graph Node Based Interpretability Guided Sample Selection for Active Learning [2022, TMI]

Medical Image Segmentation

Medical Symptom Recognition from Text

  • Medical symptom recognition from patient text: An active learning approach for long-tailed multilabel distributions [2020, ML4H]

ECG Classification

Retinal Image Analysis

  • Human-in-the-loop for efficient training of retinal image analysis methods [2021, Master's Thesis]

Biomedical Knowledge Base Construction

  • BioAct: Biomedical Knowledge Base Construction using Active Learning [2022]

Mental Disorder Therapy

  • Explainable Deep Attention Active Learning for Sentimental Analytics of Mental Disorder [2022, TALLIP]

Mobile Health Monitoring / Disease detection

Person Re-identification

Privacy Policy Classification

  • Deep Active Learning with Crowdsourcing Data for Privacy Policy Classification [2020, Arxiv]

Questionnaire

  • Vexation-Aware Active Learning for On-Menu Restaurant Dish Availability [2022, KDD]

Remote Sensing

Remote sensing is an field with really high labeling cost. AL is well used in this field.

Several surveys of AL for remote sensing:

  • A survey of active learning algorithms for supervised remote sensing image classification
  • A Survey of Active Learning for Quantifying Vegetation Traits from Terrestrial Earth Observation Data [2021]

Papers:

Semiconductor Manufacturing

  • Domain-adaptive active learning for cost-effective virtual metrology modeling [2021, Computers in Industry]

Simulation

Air Traffic Management (ATM) modeling:

  • Active Learning Metamodels for ATM Simulation Modeling [2021, SESAR ]

Software Engineering

API Misuse Detection

Solvability Prediction in Power Systems

Social Bots Detection

  • A novel framework for detecting social bots with deep neural networks and active learning [2021, KBS]

Spam Detection