What this is
A selective reference
A selective collection of open-ish AI projects, tools, and resources organized for easier browsing.
AI Resources
A selective list of notable AI models, tools, datasets, and experiments across the open-ish landscape.
It is not meant to be exhaustive. Licensing and usage terms can differ widely, so treat this as an editorial starting point rather than a blanket endorsement. Please review each project independently. Lifehubber is not responsible for any loss, harm, or issues arising from use.
What this is
A selective collection of open-ish AI projects, tools, and resources organized for easier browsing.
How to use it
Each entry includes a short description, source label, and direct link to make scanning simpler.
Editorial approach
The aim is to keep the signal cleaner, with fewer items, clearer categories, and room for future updates over time.
AI Models
google/gemma-4
A family of multimodal open models from Google DeepMind that handle text and image input and generate text output.
LiquidAI/LFM2.5-350M
A hybrid model in the LFM2.5 family built for on-device deployment, with extended pre-training and reinforcement learning.
sparkyniner/Netryx-OpenSource-Next-Gen-Street-Level-Geolocation
A locally hosted geolocation tool for estimating precise coordinates from street-level images.
louis-e/arnis
Generates real-world locations inside Minecraft with a surprisingly high level of detail.
Speech Models
CohereLabs/cohere-transcribe-03-2026
An open source 2B parameter automatic speech recognition model for audio-in, text-out transcription across 14 languages.
HumeAI/tada
A speech-language model that aligns speech and text into a single synchronized stream.
fishaudio/s2-pro
A text-to-speech model with detailed control over prosody and emotional delivery.
KittenML/KittenTTS
A very small text-to-speech model designed to stay lightweight without feeling toy-like.
AI Agents
onyx-dot-app/onyx
An application layer for LLMs with a self-hostable interface and capabilities like RAG, web search, code execution, file creation, and deep research.
Intelligent-Internet/ii-agent
An open-source AI agent for practical work, built to be run, forked, and extended across solo, team, and internal-tooling use cases.
paperclipai/paperclip
A Node.js server and React UI for orchestrating teams of AI agents, assigning goals, and tracking work and costs from one dashboard.
HKUDS/CatchMe
A lightweight, vectorless system for capturing a broader digital footprint as usable context.
openagents-org/openagents
An open collaboration project centered on AI agent networks designed to work together more openly.
THU-MAIC/OpenMAIC
An open multi-agent interactive classroom designed to offer an immersive learning experience with one-click setup.
vectorize-io/hindsight
An agent memory system designed to help agents learn over time rather than only recall conversation history.
Panniantong/Agent-Reach
A CLI that gives AI agents broader web reach across platforms like Twitter, Reddit, YouTube, GitHub, Bilibili, and XiaoHongShu without paid API usage.
MiniMax-AI/skills
A development skills library for AI coding agents, with structured guidance across frontend, fullstack, Android, iOS, and shader work.
agentscope-ai/agentscope
A production-ready agent framework with core abstractions, visibility tooling, and built-in support for fine-tuning workflows.
HKUDS/OpenSpace
A framework focused on building agents that are smarter, lower-cost, and able to improve through self-evolving workflows.
open-gitagent/gitagent
A framework-agnostic, git-native standard for defining and sharing AI agents.
allenai/molmoweb
An open multimodal web agent from Ai2 that can navigate browser tasks from natural-language instructions.
Embodied / Physical AI
unitreerobotics/unifolm-wbt-dataset
A real-world humanoid robot whole-body teleoperation dataset for open environments.
norma-core/hardware/elrobot
A low-cost 3D-printed robotic arm intended for physical AI research and imitation learning.
wu-yc/LabClaw
A large package of workflow skills for biomedical and scientific AI work across multiple lab-heavy domains.
dimensionalOS/dimos
An operating system layer for controlling robots and other hardware platforms with natural-language workflows.
Productivity
yazinsai/OpenOats
A meeting note-taking assistant designed to be more conversational and responsive than passive transcription.
Ecosystem
yusufkaraaslan/Skill_Seekers
A preprocessing layer for turning raw documentation into reusable inputs for skills, RAG pipelines, and AI coding tools.
openai/plugins
A curated collection of Codex plugin examples for extending workflows with practical plugin patterns.
googleworkspace/cli
A single command-line interface for Drive, Gmail, Calendar, Docs, Sheets, Chat, Admin, and related workflows.
lightpanda-io/browser
A headless browser designed with AI automation use cases in mind.
vllm-project/vllm-omni
A framework for serving and running omni-modality models more efficiently.
K-Dense-AI/k-dense-byok
A desktop co-scientist setup built around scientific skills and bring-your-own-key workflows.
Vaibhavs10/insanely-fast-whisper
An opinionated CLI for very fast on-device transcription with Whisper.
Datasets
allenai/olmOCR-bench
A benchmark for evaluating how well OCR systems convert PDFs into useful markdown while preserving structure.
google/WaxalNLP
A large multilingual speech corpus for African languages introduced through the WAXAL paper.
Also in AI
AI Resources is built for browsing. AI Ballot offers a live ranking shaped by reader votes.