GitHub - codesphere-community/llama2chat

llama.cpp

This repository is a fork of llama.cpp customized to facilitate Llama2 inference within Codesphere.

Overview

Llama.cpp is a powerful tool for running Llama2 inference, and this fork is tailored specifically for seamless integration with Codesphere environments.

Features

Pre-Configured CI Pipeline: The CI pipeline is set up to automatically fetch a pre-converted and quantized llama code instruct model from TheBloke on Hugging Face.
HTTP Server Example: The repository includes an HTTP server example, allowing for easy deployment and testing. Configuration options can be found in the /examples/server directory.

Usage

clone this repository in a new workspace (at least Pro/GPU)
start the Prepare stage in the CI-Pipeline
after the Prepare stage is done you can start the run stage
click on Open deployment in the top right corner

Documentation

For detailed configuration options and usage instructions, refer to the README file located in the /examples/server directory.

Note

Please note that while this repository provides a convenient setup for running Llama2 inference in Codesphere, further customization may be required to suit specific use cases or preferences.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.devops		.devops
ci		ci
common		common
docs		docs
examples		examples
gguf-py		gguf-py
grammars		grammars
media		media
models		models
pocs		pocs
prompts		prompts
scripts		scripts
spm-headers		spm-headers
tests		tests
.clang-tidy		.clang-tidy
.dockerignore		.dockerignore
.ecrc		.ecrc
.editorconfig		.editorconfig
.flake8		.flake8
.pre-commit-config.yaml		.pre-commit-config.yaml
CMakeLists.txt		CMakeLists.txt
LICENSE		LICENSE
Makefile		Makefile
Package.swift		Package.swift
README.md		README.md
SHA256SUMS		SHA256SUMS
build.zig		build.zig
ci.yml		ci.yml
codecov.yml		codecov.yml
convert-falcon-hf-to-gguf.py		convert-falcon-hf-to-gguf.py
convert-gptneox-hf-to-gguf.py		convert-gptneox-hf-to-gguf.py
convert-llama-ggml-to-gguf.py		convert-llama-ggml-to-gguf.py
convert-lora-to-ggml.py		convert-lora-to-ggml.py
convert.py		convert.py
flake.lock		flake.lock
flake.nix		flake.nix
ggml-alloc.c		ggml-alloc.c
ggml-alloc.h		ggml-alloc.h
ggml-cuda.cu		ggml-cuda.cu
ggml-cuda.h		ggml-cuda.h
ggml-metal.h		ggml-metal.h
ggml-metal.m		ggml-metal.m
ggml-metal.metal		ggml-metal.metal
ggml-mpi.c		ggml-mpi.c
ggml-mpi.h		ggml-mpi.h
ggml-opencl.cpp		ggml-opencl.cpp
ggml-opencl.h		ggml-opencl.h
ggml.c		ggml.c
ggml.h		ggml.h
k_quants.c		k_quants.c
k_quants.h		k_quants.h
llama.cpp		llama.cpp
llama.h		llama.h
llama2chat.webp		llama2chat.webp
metadata.json		metadata.json
mypy.ini		mypy.ini
requirements.txt		requirements.txt
run_with_preset.py		run_with_preset.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

llama.cpp

Overview

Features

Usage

Documentation

Note

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

llama.cpp

Overview

Features

Usage

Documentation

Note

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages