Skip to content
View shibing624's full-sized avatar
🐬
focus
🐬
focus

Organizations

@NLPchina

Block or report shibing624

Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
shibing624/README.md

Pinned Loading

  1. pycorrector pycorrector Public

    pycorrector is a toolkit for text error correction. 文本纠错,实现了Kenlm,T5,MacBERT,ChatGLM3,Qwen2.5等模型应用在纠错场景,开箱即用。

    Python 6.4k 1.2k

  2. text2vec text2vec Public

    text2vec, text to vector. 文本向量表征工具,把文本转化为向量矩阵,实现了Word2Vec、RankBM25、Sentence-BERT、CoSENT等文本表征、文本相似度计算模型,开箱即用。

    Python 5k 426

  3. MedicalGPT MedicalGPT Public

    MedicalGPT: Training Your Own Medical GPT Model with ChatGPT Training Pipeline. 训练医疗大模型,实现了包括增量预训练(PT)、有监督微调(SFT)、RLHF、DPO、ORPO、GRPO。

    Python 5.2k 723

  4. agentica agentica Public

    Agentica: Lightweight async-first Python framework for AI agents. 轻量级异步优先的AI Agent框架,支持工具调用、RAG、多智能体和MCP。

    Python 266 35

  5. ChatPDF ChatPDF Public

    RAG for Local LLM, chat with PDF/doc/txt files, ChatPDF. 纯原生实现RAG功能,基于本地LLM、embedding模型、reranker模型实现,支持GraphRAG,无须安装任何第三方agent库。

    Python 844 144

  6. imgocr imgocr Public

    Python3 package for Chinese/English OCR,use paddleocr-v5 onnx model(~20MB), with ultra-fast inference speed. 基于ppocr-v5-onnx模型推理,中英文OCR开源SOTA,推理速度超快。

    Python 129 20