Blog posts

Blogs in English:

  1. Reading list: synthetic data of LLMs, by LLMs, and for LLMs.
  2. Repoformer: Selective Retrieval for Repository-Level Code Completion.
  3. Reading list: keyphrase extraction and keyphrase generation.

Blogs in Chinese:

  1. When to retrieve?
  2. Faithfulness evaluation for text summarization.
  3. Sampling strategies for NLG and their implementations.
  4. Adapters.
  5. Sequence labeling with HMM, MEMM, and CRF.

Some blogs I found interesting:

  1. An intro to einsum.
  2. The Essense of Global Convolution Models.
  3. Transformer Inference Arithmetic