Model Gallery

2 models from 1 repositories

Filter by type:

Filter by tags:

archangel_sft_pythia2-8b

datasets: - stanfordnlp/SHP - Anthropic/hh-rlhf - OpenAssistant/oasst1 This repo contains the model checkpoints for: - model family pythia2-8b - optimized with the loss SFT - aligned using the SHP, Anthropic HH and Open Assistant datasets. Please refer to our [code repository](https://github.com/ContextualAI/HALOs) or [blog](https://contextual.ai/better-cheaper-faster-llm-alignment-with-kto/) which contains intructions for training your own HALOs and links to our model cards.

Repository: localaiLicense: apache-2.0

openvino-starling-lm-7b-beta-openvino-int8

Starling-LM-7B-beta is a Mistral-7B based chat model finetuned with RLHF and RLAIF for improved instruction following. This OpenVINO IR version features int8 quantization for optimized local inference. It utilizes the OpenChat chat template for consistent conversational output.

Repository: localaiLicense: apache-2.0