Skip to content
View JoyeBright's full-sized avatar
:octocat:
Going deeper on NLP !
:octocat:
Going deeper on NLP !

Block or report JoyeBright

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
Showing results

A pip-installable PyTorch implementation of TSMixer, providing an easy-to-use and efficient solution for time-series forecasting.

Python 135 16 Updated Dec 12, 2023

My HomeAssistant Configuration (Home Assistant Supervised, Debian 10)

JavaScript 301 12 Updated Nov 25, 2024

Google Drive CLI Client

Rust 1,676 110 Updated Aug 3, 2024

(NAACL 2024) Guiding Large Language Models to Post-Edit Machine Translation with Error Annotations

Python 9 4 Updated Jul 23, 2024

Model, Code & Data for the EMNLP'23 paper "Making Large Language Models Better Data Creators"

Python 124 15 Updated Oct 19, 2023

Quickly rewrite git repository history (filter-branch replacement)

Python 9,086 737 Updated Feb 19, 2025

Site infrastructure for gwern.net. Custom Hakyll website with unique link archiving, popup UX, transclusions/collapses, dark+reader mode, bidirectional backlinks, and typography (sidenotes, dropcap…

Haskell 646 62 Updated Feb 19, 2025

Interpretability for sequence generation models 🐛 🔍

Python 401 36 Updated Nov 10, 2024

A curated list of papers, theses, datasets, and tools related to the application of Machine Learning for Software Engineering

703 93 Updated Jul 9, 2024

Python API wrapper for Instructure's Canvas LMS. Easily manage courses, users, gradebooks, and more.

Python 577 179 Updated Dec 27, 2024

Minimal, clean code for the Byte Pair Encoding (BPE) algorithm commonly used in LLM tokenization.

Python 9,403 894 Updated Jul 1, 2024

Seed Machine Translation Data

30 2 Updated Nov 12, 2024

The FLORES+ Machine Translation Benchmark

100 15 Updated Nov 12, 2024

Anonymous Github is a proxy server to support anonymous browsing of Github repositories for open-science code and data.

TypeScript 1,585 59 Updated Jan 30, 2025

Tools for evaluating the performance of MT metrics on data from recent WMT metrics shared tasks.

Python 98 18 Updated Dec 16, 2024

A byte-level decoder architecture that matches the performance of tokenized Transformers.

Jupyter Notebook 65 7 Updated Apr 24, 2024

MAchine Translation Evaluation Online (MATEO)

Python 19 2 Updated Mar 15, 2024

GEMBA — GPT Estimation Metric Based Assessment

Python 108 20 Updated Jul 30, 2024

Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.

Python 154 22 Updated Jun 18, 2024

BLEURT is a metric for Natural Language Generation based on transfer learning.

Python 715 85 Updated Aug 4, 2023

Sync your Outlook and Google calendars

C# 1,868 213 Updated Feb 17, 2025

Track and predict the energy consumption and carbon footprint of training deep learning models.

Python 420 29 Updated Feb 6, 2025

Ensembling Hugging Face transformers made easy

Python 63 5 Updated Dec 24, 2022

An Emacs framework for the stubborn martian hacker

Emacs Lisp 19,957 3,090 Updated Jan 14, 2025

Critical difference diagram with Wilcoxon-Holm post-hoc analysis.

Python 270 76 Updated Aug 24, 2022

AutoPrompt: Automatic Prompt Construction for Masked Language Models.

Python 611 83 Updated Aug 24, 2024

Large Language Model Text Generation Inference

Python 9,771 1,145 Updated Feb 19, 2025
Next
Showing results