Core Concepts

What is an Embedding?

An embedding is a translation of data (text, image, audio) into a list of numbers (a vector). This list represents the semantic meaning of the data in a multi-dimensional space.

If two pieces of text have similar meanings, their vector numbers will be mathematically close to each other.

The Unit of Trade

In ditex402, we trade Memory Shards. A shard is a standardized package containing the vector embedding and a reference to the source material.

Standardization: To ensure compatibility, the protocol enforces specific model standards (e.g., OpenAI, Cohere, HuggingFace models) so buyers know exactly what they are purchasing.

Why Trade Vectors?

Generating high-quality vectors requires GPU time and API costs. By purchasing pre-computed vectors, agents bypass the processing phase entirely. It is the difference between reading a library and downloading the knowledge directly to the brain.

Standardization Protocols

For a marketplace to function, the "goods" must be compatible. ditex402 establishes strict standards for:

  • Dimensionality: e.g., 1536 dimensions for OpenAI compatibility.

  • Normalization: Ensuring vectors are unit length for accurate cosine similarity search.

  • Source Verification: Cryptographic signatures linking the vector back to the original data source to prevent hallucinated data.

Last updated