Can we parallelize semantic recall at all

Hey Team,

We noticed that when we turn on semantic recall, the time it takes for a response to come in client side increases ~80%.

Can we parallelize this in anyway to speed this up?

Here is the setup:

const memory = new Memory({
  storage: new PostgresStore({
    connectionString: process.env.DATABASE_URL!,
    schemaName: "mastra",
  }),
  vector: new PgVector({
    connectionString: process.env.DATABASE_URL!,
    schemaName: "mastra",
  }),
  embedder: 'openai/text-embedding-3-small',
  options: {
    lastMessages: 10,
    workingMemory: {
      enabled: true,
      scope: "resource",
      schema: userContextSchema,
    },
    semanticRecall: {
      topK: 3,
      messageRange: 2,
      indexConfig: {
        type: "hnsw",
        metric: "dotproduct",
      },
    },
  },
});

const memory = new Memory({
  storage: new PostgresStore({
    connectionString: process.env.DATABASE_URL!,
    schemaName: "mastra",
  }),
  vector: new PgVector({
    connectionString: process.env.DATABASE_URL!,
    schemaName: "mastra",
  }),
  embedder: 'openai/text-embedding-3-small',
  options: {
    lastMessages: 10,
    workingMemory: {
      enabled: true,
      scope: "resource",
      schema: userContextSchema,
    },
    semanticRecall: {
      topK: 3,
      messageRange: 2,
      indexConfig: {
        type: "hnsw",
        metric: "dotproduct",
      },
    },
  },
});

Mastra•5mo ago•

2 replies

Saulved

Can we parallelize semantic recall at all

const memory = new Memory({
  storage: new PostgresStore({
    connectionString: process.env.DATABASE_URL!,
    schemaName: "mastra",
  }),
  vector: new PgVector({
    connectionString: process.env.DATABASE_URL!,
    schemaName: "mastra",
  }),
  embedder: 'openai/text-embedding-3-small',
  options: {
    lastMessages: 10,
    workingMemory: {
      enabled: true,
      scope: "resource",
      schema: userContextSchema,
    },
    semanticRecall: {
      topK: 3,
      messageRange: 2,
      indexConfig: {
        type: "hnsw",
        metric: "dotproduct",
      },
    },
  },
});

const memory = new Memory({
  storage: new PostgresStore({
    connectionString: process.env.DATABASE_URL!,
    schemaName: "mastra",
  }),
  vector: new PgVector({
    connectionString: process.env.DATABASE_URL!,
    schemaName: "mastra",
  }),
  embedder: 'openai/text-embedding-3-small',
  options: {
    lastMessages: 10,
    workingMemory: {
      enabled: true,
      scope: "resource",
      schema: userContextSchema,
    },
    semanticRecall: {
      topK: 3,
      messageRange: 2,
      indexConfig: {
        type: "hnsw",
        metric: "dotproduct",
      },
    },
  },
});

Can we parallelize semantic recall at all

Similar Threads

Can we parallelize semantic recall at all

Similar Threads

Similar Threads

Similar Threads