Skip to content

PassTrough Embeddings #132

@EpicUsaMan

Description

@EpicUsaMan

in some tasks we need deep contextual embedding from LLM

Ways to include:

  • PCA over embeddings on preprocessing stage (unsupervised, linear)
  • LDA over embeddings on preprocessing stage (supervised, linear)
  • UMAP over embeddings on preprocessing stage (unsupervised, non-linear)
  • VectorNormalization over embeddings on preprocessing stage (unsupervised, linear)
    -> np.linalg.norm(embd) and sum over it (in practice we can generate two features out of it, unit vector and magnitude)
  • Raw include
    -> x = concat([self.embeddings(num_features, cat_features), emb_features])
    -> x = self.embeddings(num_features, cat_features) + emb_features

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions