https://github.com/paritoshk/anysphere_test

"speculative edits" using pytorch and huggingface with temperature 0 (greedy sampling)
https://github.com/paritoshk/anysphere_test

Last synced: over 1 year ago
JSON representation

"speculative edits" using pytorch and huggingface with temperature 0 (greedy sampling)

Host: GitHub
URL: https://github.com/paritoshk/anysphere_test
Owner: paritoshk
Created: 2024-09-18T02:26:30.000Z (almost 2 years ago)
Default Branch: master
Last Pushed: 2024-09-18T04:04:15.000Z (almost 2 years ago)
Last Synced: 2025-01-29T17:11:37.635Z (over 1 year ago)
Language: Python
Size: 22.5 KB
Stars: 1
Watchers: 1
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md

Awesome Lists containing this project

README

          # Speculative Edits

## Problem

Your goal is to implement "speculative edits" using pytorch and huggingface with temperature 0 (greedy sampling)

We describe speculative edits in brief detail near the end of [this blog post](https://cursor.sh/blog/instant-apply).

To summarize: 

Instead of a draft model producing draft tokens (as in done in speculative edits), we manually produce the draft tokens ourselves since we have a strong prior on generated tokens for sparse edits represented as rewrites.

Consider the following prompt:

````txt

Please add a single comment

```ts

export default function Visualization() {

  const [instanceIdInputs, setInstanceIdInputs] = createSignal<

    InstanceId[] | null

  >(null);

  const [storedInput, setStoredInput] = createSignal("");

  const [datapointOptions, setDatapointOptions] = createSignal(

    []

  );

  const [shouldRefreshGold, setShouldRefreshGold] =

    createSignal(false);

  const [showGold, setShowGold] = createSignal(false);

  const [selectedGoldRequestId, setSelectedGoldRequestId] = createSignal<

    string | undefined

  >(undefined);

  const [goldInstances, setGoldInstances] = createSignal<

    {

      sessionId: string;

      email: string | undefined;

      requestId: string | undefined;

      dateAdded: Date;

      type: $Enums.CppGoldExampleType;

    }[]

  >([]);

}

```

```ts

````

You should be able to generate this code much faster than vanilla token generation

with speculative edits. Why?

A sample generated response is:

````

export default function Visualization() {

  const [instanceIdInputs, setInstanceIdInputs] = createSignal<

    InstanceId[] | null

  >(null);

  const [storedInput, setStoredInput] = createSignal("");

  const [datapointOptions, setDatapointOptions] = createSignal(

    []

  );

  const [shouldRefreshGold, setShouldRefreshGold] =

    createSignal(false);

  const [showGold, setShowGold] = createSignal(false);

  const [selectedGoldRequestId, setSelectedGoldRequestId] = createSignal<

    string | undefined

  >(undefined);

  # This is a comment

  const [goldInstances, setGoldInstances] = createSignal<

    {

      sessionId: string;

      email: string | undefined;

      requestId: string | undefined;

      dateAdded: Date;

      type: $Enums.CppGoldExampleType;

    }[]

  >([]);

}

```

````

On the first forward pass, we can feed in the entire original code block as a speculation. Then we start generating tokens when the model disagrees with our draft. (at the line `# this is a comment`). Eventually, we'll want to re-speculate on the remainder of the prompt.

Note that the choice of greedy sampling simplifies things.

If developing locally, we'd recommend using a small model like `gpt-2`.

If you would like to test real performance on smarter models before submission, you may use the card provided in the form to purchase $10 worth of GPU-hours. We'd recommend 3090s or 4090s on [Runpod](https://runpod.io), but any machine with >18GB of GPU RAM will suffice.

A good model to test would be `llama-3-8b-instruct` or `deepseek-coder-6.7b-instruct`.

## Submission Instructions

When finished, please zip this folder and upload the zipfile to this form: https://docs.google.com/forms/d/1COjBZBA5jSxpaLtBZlD-ktAS0LnWbiH4LfyUgzowgpg/edit

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/paritoshk/anysphere_test

Awesome Lists containing this project

README