From 644e5c7e46c0d65dc4b402b0264c9155a91ab1b4 Mon Sep 17 00:00:00 2001 From: Joshua David Date: Wed, 17 Jul 2024 23:00:05 -0700 Subject: [PATCH] Update README.md to be more detailed --- README.md | 5 +++++ 1 file changed, 5 insertions(+) diff --git a/README.md b/README.md index e0c898b..a8e53c7 100644 --- a/README.md +++ b/README.md @@ -194,6 +194,11 @@ print(output.shape) # Expected shape: (batch_size, target_length, d_model) My implementation of LongRoPE achieves the following results: +1. Perplexity: + - 4k context: X.XX + - 128k context: X.XX + - 2048k context: X.XX + ## Citation