Hierarchical Large Language Model (H-LLM) Exploration

Prerequisites

Before starting, make sure you meet the following requirements:

Python 3.x installed.
Operating System: Preferably Linux or macOS.
Hardware Requirements: Minimum of four NVIDIA RTX A4500 GPUs, totaling approximately 80 GiB of GPU memory.

Installation

Install the required packages using the following command in your terminal:

pip install -r requirements.txt

Configuration

Setting Paths

To ensure the scripts function correctly, you need to update the file paths in each script according to your system setup:

`llama_main.py`:

base_model_name: Identifier for the Hugging Face model.
new_model_path: Where new, unlearned models are saved.
pretrained_model_name: Where combined models are saved.
data_name: Path to the dataset.
file_path: Where evaluation outputs are logged.

`tinyllama_main.py`:

base_model_name: Identifier for the TinyLLaMA model.
new_model_path: Where unlearned model checkpoints are stored.
new_model_retrained: Where retrained TinyLLaMA models are saved.
file_path: Where evaluation outputs are logged.

`main.py`:

model_path: For saving tokenizer and model configurations.
output_path: Where distilled and pre-trained models are saved.
data_name: Name or path of the dataset file.

`learn_2024.py`:

dataset_name: Name or path of the dataset file.

`Evaluate.py`:

Set OPENAI_API_KEY in your environment variables for accessing OpenAI services.

Running the Scripts

Use the provided shell script main_run.sh to run all models and scripts simultaneously. Ensure this script is correctly set up with paths to the Python files and is executable:

chmod +x main_run.sh
./main_run.sh

This script runs each Python script in parallel, directing their outputs to designated log files and ensuring comprehensive execution tracking.

Usage

Execute the models using the shell script:

./main_run.sh

This command initiates parallel processing of the models and logs their output for review.

Contributing

You can contribute to this project in several ways:

Reporting Bugs: Submit detailed reports of any issues encountered.
Suggesting Enhancements: Propose ideas for improvements or new features.
Making Pull Requests: Follow the guidelines to create and submit pull requests effectively.

Please refer to CONTRIBUTING.md for detailed guidelines on contributing to the project.

License

This project is licensed under the Apache License Version 2.0, January 2004. Full license text is available in the LICENSE file.

Name		Name	Last commit message	Last commit date
Latest commit History 16 Commits
.github/workflows		.github/workflows
Datasets/Eval_Data		Datasets/Eval_Data
Evaluate.py		Evaluate.py
Gen_ZTweet.txt		Gen_ZTweet.txt
LICENSE		LICENSE
README.md		README.md
articles.json		articles.json
config.py		config.py
data.py		data.py
dataset2024.txt		dataset2024.txt
eval.py		eval.py
events.out.tfevents.1712105932.nlp.3481435.0		events.out.tfevents.1712105932.nlp.3481435.0
events.out.tfevents.1712106710.nlp.3485832.0		events.out.tfevents.1712106710.nlp.3485832.0
info_table.txt		info_table.txt
input.txt		input.txt
learn_2024.py		learn_2024.py
learn_llama.py		learn_llama.py
llama_main.py		llama_main.py
loss_plot.png		loss_plot.png
losses.png		losses.png
main.py		main.py
main_run.sh		main_run.sh
model.py		model.py
output_log.txt		output_log.txt
politics2024.json		politics2024.json
requirements.txt		requirements.txt
save_tokenizer.py		save_tokenizer.py
tinyllama_main.py		tinyllama_main.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hierarchical Large Language Model (H-LLM) Exploration

Prerequisites

Installation

Configuration

Setting Paths

`llama_main.py`:

`tinyllama_main.py`:

`main.py`:

`learn_2024.py`:

`Evaluate.py`:

Running the Scripts

Usage

Contributing

License

About

Releases

Packages

Languages

License

Nawrin2k16/H-LLM

Folders and files

Latest commit

History

Repository files navigation

Hierarchical Large Language Model (H-LLM) Exploration

Prerequisites

Installation

Configuration

Setting Paths

llama_main.py:

tinyllama_main.py:

main.py:

learn_2024.py:

Evaluate.py:

Running the Scripts

Usage

Contributing

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

`llama_main.py`:

`tinyllama_main.py`:

`main.py`:

`learn_2024.py`:

`Evaluate.py`:

Packages