1 min readfrom Machine Learning

[D] Is this considered unsupervised or semi-supervised learning in anomaly detection?

Hi 👋🏼, I’m working on an anomaly detection setup and I’m a bit unsure how to correctly describe it from a learning perspective.

The model is trained using only one class of data (normal/benign), without using any labels during training. In other words, the learning phase is based entirely on modelling normal behaviour rather than distinguishing between classes.

At evaluation time, I select a decision threshold on a validation set by choosing the value that maximizes the F1-score.

So the representation learning itself is unsupervised (or one-class), but the final decision boundary is chosen using labeled validation data.

I’ve seen different terminology used for similar setups. Some sources refer to this as semi-supervised, while others describe it as unsupervised anomaly detection with threshold calibration.

What would be the most accurate way to describe this setting in a paper without overclaiming?

submitted by /u/Opening-Rich-4425
[link] [comments]

Want to read more?

Check out the full article on the original site

View original article

Tagged with

#automated anomaly detection
#machine learning in spreadsheet applications
#generative AI for data analysis
#Excel alternatives for data analysis
#real-time data collaboration
#big data management in spreadsheets
#conversational data analysis
#intelligent data visualization
#data visualization tools
#enterprise data management
#big data performance
#data analysis tools
#data cleaning solutions
#rows.com
#natural language processing for spreadsheets
#cloud-based spreadsheet applications
#financial modeling with spreadsheets
#real-time collaboration
#anomaly detection
#unsupervised learning