https://github.com/dluman/char-rnn

A basic, single-layer character-level RNN based on Andrej Karpathy's "vanilla" minimal RNN. I am looking for suggestions for improvements!
https://github.com/dluman/char-rnn

Last synced: 3 months ago
JSON representation

A basic, single-layer character-level RNN based on Andrej Karpathy's "vanilla" minimal RNN. I am looking for suggestions for improvements!

Host: GitHub
URL: https://github.com/dluman/char-rnn
Owner: dluman
License: unlicense
Created: 2017-06-07T15:20:26.000Z (about 8 years ago)
Default Branch: master
Last Pushed: 2017-06-21T17:20:06.000Z (almost 8 years ago)
Last Synced: 2025-01-17T05:41:42.120Z (5 months ago)
Language: Python
Homepage:
Size: 4.47 MB
Stars: 0
Watchers: 2
Forks: 0
Open Issues: 0
Metadata Files:
- Readme: README.md
- License: LICENSE

Awesome Lists containing this project

README

# char-rnn
A basic, single-layer character-level RNN based on Andrej Karpathy's "vanilla" minimal RNN. This is largely a class-based transcription of the 100-line gist provided here: https://gist.github.com/karpathy/d4dee566867f8291f086.

I am very intersted in making this code much better, even adapting it to a mutliple-layer RNN. While I will eventually have to go to a framework (like Tensorflow), I am interested in working framework-free for a while.

## Intention

I want to write a RNN that studies style and writes stylistically like a given source text. Right now, the routine hits a hard loss limit of about 39.0000, even after millions of iterations. This code is currently run on a VPS sans-GPU. As I wrote above, I am absolutely open to suggestions of how I can improve this up to the limit of needing a framework.

ecosyste.ms

Data

Tools

Indexes

Applications

Experiments

Awesome

https://github.com/dluman/char-rnn

Awesome Lists containing this project

README