-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Plot logits and ground truth like a spectrogram #5
Comments
At the moment it seems that the plotting function is a bit broken. The number of xticks is as specified in the code, however the amount of total duration of the track differs very much from the length of the processed song. Moreover, it seems that either inference or loss function are wrong, cause logits are very similar for every string. The current output of logits shows that all the strings output the polyphonic recognition instead of each string outputting its own monophonic pitch |
Alright, the |
* fix yticks and xticks * fix the fontsize * change the aspect ratio related to: #5
Export of the figure is now much more suited for the polyphonic logits output. What's missing is the ground truth plot on top of the logits or at least next to them. Plots compatible with |
First the |
Make sure that the logits are plotted nicely with the ground truth for the one string model and the original |
This is now important again. Logit plots with ground truth on top do work well for the test set, however this is not currently available for audio files provided with labels, nor for monophonic audio files with polyphonic labels neither. What we need is a function that can take: The inside function has to take:
Tweak the existing |
Make a story with the plots. First show a plot with a raw pitch output, then output with pitch with periodicity values printed on top of it and finally different thresholds for the periodicity values filtering the pitch. |
Fixes and plots:
Description on how did the model get trained for the multipitch strings.
Get a better understanding of the decoding with periodicity thresholding and if softmax is applied correctly |
Logits after sigmoid make so much more sense when visualized. Unnormalized logits are in a way spread and not that clear. |
|
Example command for potting with the FCN model: python -m penn.plot.to_latex --config config/polypennfcn-15ks-batch.py --checkpoint runs/polypennfcn-15ks-batch/00005000.pt --audio_file data/cache/gset/000121.wav -m -l --ground_truth_file data/cache/gset/000121-pitch.npy -m -l |
|
No description provided.
The text was updated successfully, but these errors were encountered: