Custom Classifiers

Custom classifiers are text classifiers that you can fine-tune to classify whatever you want. They show up the same as all other classifiers in your feeds.

You create them by clicking New Classifier in the Classifiers section.

Creating a Custom Classifier

Methods for creating a custom classifier

You have three methods you can choose from for creating custom classifier: Create an ensemble, Tag similar sentences, or Train a model.

Create an ensemble

If you don't have the time to train a classifier, start here. Ensembles in Caravel are the easiest way to create a custom classifier because it requires zero training.

In Machine Learning, ensembles are a combination of models that serve as a single model to get better prediction results than what one model can provide. In Caravel, ensembles operate similarly but allow you to use more than just models.

Using ensembles in Caravel you can combine zero-shot classification, keyword search, and prebuilt models to classify your text.

They are incredibly flexible and very quick to get started.

Tag similar sentences

If you have a need to provide feedback to your classifier to continuously improve its accuracy start here.

With the tagging method, you can tag sentences from your sources and train your classifier to predict similar statements. You can start with a few sentences and increase the confidence threshold of these classifiers as you add more samples to make them more precise.

Train a model

If you have a complex topic you need to classify that is more general in nature, perhaps it's a better fit for a traditional model.

At the time of this writing, model training is accessible by request only.

Custom Classifier Concepts

Confidence

In all classifier you have control over the confidence value. Confidence is a threshold for a classifier when predicting your label. The higher the value, the more scrutiny your classifier will apply while prediction mentions of your labels.

With trainable classifiers, like ones that use the Tagging method, you can increase the confidence as you add samples to improve the classifier's precision with more data and you can lower the confidence to get predictions with less samples.

Natural Language Definition

When using the Ensemble method you have the option to input a Natural language definition. This enables you to use Zero-Shot Classification with your ensemble. Think of this a practical name for your label that Zero-Shot Classification can understand to predict mentions of your label. To learn more about Zero-Shot Classification and how it works, see here.

Sentences

When cleaning and parsing your sources of text for classification, Caravel automatically breaks your text into sentences. Each sentence is then passed into your classifiers to be labeled.

When providing samples to train your classifiers, you can tag sentences using our Highlighting UI. This enables you to quickly provide many samples to your classifiers. It also enables Caravel to identify multiple labels from the same classifier within a single message.