Skip to content

MVA - Kernek Methods in Machine Learning - Data Challenge

Notifications You must be signed in to change notification settings

VictorSanh/Kernel_Challenge

Repository files navigation

Kernel_Challenge

MVA - Kernek Methods in Machine Learning - Data Challenge


How to install the requirements ?

pip3 install -r requirements.txt

PACKAGES - NOTES

## How to use k-fold cross-validation

  • If you are using a kernelMethod class (for instance kernelSVM or kernelKNN), just use method assess(): Ex: kNN.assess(data, labels, n_folds, stringsData)

    • data and labels are given as matrices (tr0['Sequence'].as_matrix() will do the trick)
    • n_folds is the number of folds for cross-valisation
    • stringsData is True if you are providing the original text data, False if you are passing transformed data that lives in a Euclidian space
  • If you are not using a kernelMethod class, use fonction kfold() from kernel_methods.py: Ex: kfold(data, labels, n_folds, train_method, predict_method, classify_method, labels_formatting, metric, target_folds, verbose, **kwargs)

    • train_method is what will be used to train your model from train data (arg1) and train labels (arg2). It should be a method from your classifier that modifies learned parameters stored as classifier attributes. **kwargs will be passed to this method

    • pred_method is the method for generating prediction from validation data (arg1). **kwargs will be passed to this method. The method should generate raw predictions, for instance 0.52 or -0.21

    • classify_method is the method that takes a raw prediction and returns the classified prediction, for instance 0.52-->1 or -0.21-->-1 for SVM

    • labels_formatting is useful for classifiers that need to have a given labels format. For intance, SVM requires labels to be in {-1,+1}. If your method requires no formatting, just pass the identity fct

    • metric will measure the performance by comparing labels to predictions. For instance, a m_binary from metrics.py

    • target_folds is onnly used by grid_search function, just ignore this parameter

About

MVA - Kernek Methods in Machine Learning - Data Challenge

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published