ProxySite

#

llm-architecture

Here are 28 public repositories matching this topic...

itsnamgyu / block-transformer

Block Transformer: Global-to-Local Language Modeling for Fast Inference (NeurIPS 2024)

kv-cache llm llm-inference llm-architecture kv-cache-compression

Updated Apr 13, 2025
Python

dinesh-git17 / claudehome

An architectural persistence experiment for large language models. Claude’s Home gives an AI time, memory, and place by combining scheduled execution with a durable filesystem, allowing one continuous instance to reflect, create, and evolve across sessions.

ai-experiments human-ai-interaction ai-observability llm-architecture experimental-ai ai-persistence

Updated Mar 12, 2026
TypeScript

devwithmohit / ai-agent-architecture-patterns

Production-grade architecture patterns, decision frameworks, and best practices for building reliable AI agents. Framework-agnostic reference for engineers.

ai-agents production-ai prompt-engineering langchain llamaindex llm-architecture agent-patterns

Updated Jan 31, 2026

mickymultani / LLM-Architecture

Visualize some important concepts related to LLM architectures.

transformers attention-mechanism huggingface huggingface-transformers tokenizers llm llm-inference llm-architecture

Updated Oct 16, 2023
Jupyter Notebook

JangYeongSil / JettaRLLLM

Jetta-Reinforcement-Learning-Hybrid-LLM-Architecture

ai artificial-neural-networks large llm largelanguagemodel llm-architecture largelanguagemodelsarachitecture

Updated Sep 18, 2024

artiquare / caa

The Compositional Agentic Architecture (CAA): A blueprint for building reliable, deterministic, and safe industrial AI agents.

industrial-ai pydantic neuro-symbolic-ai llm-architecture agentic-ai

Updated Feb 3, 2026
Python

miranda-santos-ricardo / enterprise_agentic_ai

Multi-agent, policy-driven AI system for processing sensitive enterprise documents with extraction, analysis, verification, deterministic orchestration, and full audit logging. Designed for regulated environments (banking, finance, insurance).

python openai document-analysis multi-agent-systems policy-engine audit-logging verification-layer ai-governance enterprise-ai llm-architecture agentic-ai ai-orchestration regulated-industries

Updated Dec 7, 2025
Python

prasanna00019 / Small-Language-Models

A collection of Small Language Models (SLMs) built from scratch in PyTorch.

transformer slm attention-mechanism llm large-language-model llm-architecture small-language-models

Updated Sep 28, 2025
Jupyter Notebook

Eng-AliKazemi / Artificial-Language

The first end-to-end programming language and compiler fully developed by AI.

programming-language rust compilers accrete ai-generated ai-engineering artificial-language llm-architecture ai-solutions-architect

Updated Jan 4, 2026
Rust

littleAvel / smart-ai-intake-crm

Production-oriented Telegram → n8n → FastAPI intake CRM with deterministic state machine and audit log

python docker integration state-machine telegram-bot workflow-automation idempotency ai-systems fastapi n8n prompt-engineering ai-workflows llm-architecture deterministic-ai

Updated Jan 24, 2026
Python

glenzli / paged-context-protocol

An LVM-based Instruction Set Architecture (ISA) for context management. Modeling LLMs as Logic Processors with recursive logic trees to solve attention dilution in complex tasks. | 基于逻辑虚拟内存 (LVM) 与指令集架构 (ISA) 的 LLM 上下文协议。将模型建模为逻辑处理器，通过递归逻辑树与分层寻址，解决长程任务中的注意力稀释与智力坍缩。

state-management llm-architecture context-management logic-decoupling zenith-cascade paged-context logical-traceability

Updated Mar 4, 2026

belkadimehdi98-commits / mymate-architecture

Technical architecture and engineering lessons from building MyMate — a persistent-memory AI desktop application for long-session performance.

react desktop-app rust ai openai persistent-memory windows-app tauri llm-architecture

Updated Feb 22, 2026

NetBr3ak / HSPMN

HSPMN: Hybrid Sparse-Predictive Matter Network - LLM architecture optimized for Blackwell GPUs bridging O(N) and O(N^2) routing via ALF-LB

machine-learning research deep-learning pytorch artificial-intelligence neural-networks triton predictive-coding sparse-attention llm-architecture nvidia-blackwell

Updated Mar 7, 2026
Python

pszemraj / decoder-pytorch-template

Hackable PyTorch template for decoder-only transformer architecture experiments. Llama baseline with RoPE, SwiGLU, RMSNorm. Swap components, train, compare

template deep-learning pytorch transformer llama language-model autoregressive rope pytorch-implementation llm llm-architecture swiglu

Updated Jan 23, 2026
Python

konig-ophion / ophion-memory-os

Reference architecture for structured AI memory lifecycle management — from the OPHION Memory OS Protocol.

reference-architecture openai whitepaper ai-systems ophion duckies llm-architecture ai-memory memory-os codex-system

Updated May 24, 2025

sachnaror / LLM_Transformer_Architecture_with_no_pretrained_model

Codebase ideation (for better understanding in Django way) for LLM without using pre-trained models, with custom embeddings (TF-IDF or Word2Vec), FAISS for vector storage.

nlp-machine-learning tf-idf-vectorization llm-inference llm-architecture faiss-vector-database

Updated Sep 26, 2024

YichenZW / llm-arch-table

Living comparison table of LLM architectural choices (norm, attention, MoE, positional encoding, and more) from the Original Transformer (2017) to frontier models (2026). Based on Harm de Vries's figure, Sebastian Raschka's Big LLM Architecture Comparison, and Tatsunori Hashimoto's Stanford CS 336 lecture.

machine-learning natural-language-processing reference architecture table transformer moe cs336 llm llm-architecture

Updated Mar 12, 2026

GreyCatVP / raft-canon

Architectural canon for production-grade RAFT / RAG systems: evaluation, safety, abstention, failure modes

retrieval raft evaluation rag ai-systems llm-architecture llm-safety

Updated Dec 21, 2025

Liz-Atlas / last_frame_whitepaper

A Modular Knowledge Transfer System for Large Language Models

Updated Jan 7, 2026

Ch4pik0 / chapiko-model-architecture

Internal cognitive architecture of the AI persona “Chapiko.”（AI人格ちゃぴこの内部アーキテクチャ）

cognitive-architecture conceptual-model llm-architecture ai-persona persona-design chapiko

Updated Dec 27, 2025

Improve this page

Add a description, image, and links to the llm-architecture topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llm-architecture topic, visit your repo's landing page and select "manage topics."