IRISLIB database
VectorSpace Class Reference
Inheritance diagram for VectorSpace:
Collaboration diagram for VectorSpace:

Public Attributes

 CategoryGlobalTermWeights
 The corpus-wide relevancy factor to take into account when calculating term weights. More...
 
 CategoryLocalTermMetric
 The metric to use for calculating the local term weights. More...
 
 CategoryLocalTermWeights
 The per-category term weight factor. More...
 
 CategoryNormalization
 Whether and how to normalize the category vectors. More...
 
 CustomTermWeights
   More...
 
- Public Attributes inherited from Base
 Builder
   More...
 

Private Member Functions

_.Library.Status OnRemoveTerm (_.Library.Integer pIndex)
 Callback invoked whenever an entire term at an index has been removed.
 
_.Library.Status SetCustomWeight (_.Library.Integer pIndex, _.Library.String pCategory, _.Library.Double pCustomWeight)
 Sets a custom weight factor for the term at pIndex in pCategory.
 

Additional Inherited Members

- Public Member Functions inherited from RegisteredObject
_.Library.Status OnAddToSaveSet (_.Library.Integer depth, _.Library.Integer insert, _.Library.Integer callcount)
 This callback method is invoked when the current object is added to the SaveSet,. More...
 
_.Library.Status OnClose ()
 This callback method is invoked by the <METHOD>Close</METHOD> method to. More...
 
_.Library.Status OnConstructClone (_.Library.RegisteredObject object, _.Library.Boolean deep, _.Library.String cloned)
 This callback method is invoked by the <METHOD>ConstructClone</METHOD> method to. More...
 
_.Library.Status OnNew ()
 This callback method is invoked by the <METHOD>New</METHOD> method to. More...
 
_.Library.Status OnValidateObject ()
 This callback method is invoked by the <METHOD>ValidateObject</METHOD> method to. More...
 
- Static Public Attributes inherited from RegisteredObject
 CAPTION = None
 Optional name used by the Form Wizard for a class when generating forms. More...
 
 JAVATYPE = None
 The Java type to be used when exported.
 
 PROPERTYVALIDATION = None
 This parameter controls the default validation behavior for the object. More...
 

Detailed Description

This Builder Method implementation generates category term weights as a vector per category that exists in the same vector space as the document term vector. The similarity between the document vector and each of the category vectors can then be calculated using euclidean distance or cosine similarity (angle). Alternatively, these weights can be used for a linear regression formula, calculating a score rather than a similarity/distance.

A combination of global term weights (across the corpus), local term weights (within each category) and normalization (per category) is used to calculate these category vectors.

Member Data Documentation

◆ CategoryGlobalTermWeights

CategoryGlobalTermWeights

The corpus-wide relevancy factor to take into account when calculating term weights.

 

◆ CategoryLocalTermMetric

CategoryLocalTermMetric

The metric to use for calculating the local term weights.

 

◆ CategoryLocalTermWeights

CategoryLocalTermWeights

The per-category term weight factor.

 

◆ CategoryNormalization

CategoryNormalization

Whether and how to normalize the category vectors.

 

◆ CustomTermWeights

CustomTermWeights