Skip to main content

Forwarded for publication. Fwd: Top Important LLM Papers for the Week from 01/01 to 07/01

Forwarded for publication on www.marielandryceo.com 

Marie Seshat Landry
CEO / Spymaster
Marie Landry's Spy Shop
www.marielandryceo.com


---------- Forwarded message ---------
From: Youssef Hosni from To Data & Beyond <youssefh@substack.com>
Date: Wed, Jan 10, 2024 at 12:27 AM
Subject: Top Important LLM Papers for the Week from 01/01 to 07/01
To: <marielandryx@gmail.com>


Stay Updated with Recent Large Language Models Research  ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌ ‌
Forwarded this email? Subscribe here for more

Top Important LLM Papers for the Week from 01/01 to 07/01

Stay Updated with Recent Large Language Models Research

Jan 10
 
READ IN APP
 

Every week, several top-tier academic conferences and journals showcased innovative research in computer vision, presenting exciting breakthroughs in various subfields such as image recognition, vision model optimization, generative adversarial networks (GANs), image segmentation, video analysis, and more.

This article provides a comprehensive overview of the most significant papers published in the first week of January 2024, highlighting the latest research and advancements in computer vision. Whether you're a researcher, practitioner, or enthusiast, this article will provide valuable insights into the state-of-the-art techniques and tools in computer vision.

Table of Contents:

  1. LLM Progress & Benchmarking

  2. LLM Fine Tuning

  3. LLM Reasoning

  4. LLM Training & Evaluation

  5. Transformers & Attention Based Models



1. LLM Progress & Benchmarking

  1. Boosting Large Language Model for Speech Synthesis: An Empirical Study

  2. LARP: Language-Agent Role Play for Open-World Games

  3. Improving Text Embeddings with Large Language Models

  4. PanGu-$Ï€$: Enhancing Language Model Architectures via Nonlinearity Compensation

  5. COSMO: COntrastive Streamlined MultimOdal Model with Interleaved Pre-Training

  6. TinyLlama: An Open-Source Small Language Model

  7. GeoGalactica: A Scientific Large Language Model in Geoscience

  8. Unicron: Economizing Self-Healing LLM Training at Scale

  9. LLaMA Beyond English: An Empirical Study on Language Capability Transfer

  10. DocLLM: A layout-aware generative language model for multimodal document understanding

  11. LLaMA Pro: Progressive LLaMA with Block Expansion

  12. LLM Augmented LLMs: Expanding Capabilities through Composition

  13. LLaVA-$φ$: Efficient Multi-Modal Assistant with Small Language Model

  14. A Comprehensive Study of Knowledge Editing for Large Language Models

2. LLM Fine Tuning

  1. Self-Play Fine-Tuning Converts Weak Language Models to Strong Language Models

  2. Astraios: Parameter-Efficient Instruction Tuning Code Large Language Models

3. LLM Reasoning

  1. Towards Truly Zero-shot Compositional Visual Reasoning with LLMs as Programmers

  2. Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models

4. LLM Training & Evaluation

  1. Beyond Chinchilla-Optimal: Accounting for Inference in Language Model Scaling Laws

  2. Understanding LLMs: A Comprehensive Overview from Training to Inference

5. Transformers & Attention Based Models

  1. Boundary Attention: Learning to Find Faint Boundaries at Any Resolution

  2. ICE-GRT: Instruction Context Enhancement by Generative Reinforcement-based Transformers


To Data & Beyond is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber.

Are you looking to start a career in data science and AI and do not know how? I offer data science mentoring sessions and long-term career mentoring:

Invite your friends and earn rewards

If you enjoy To Data & Beyond, share it with your friends and earn rewards when they subscribe.

Invite Friends

 
Like
Comment
Restack
 

Comments

CLICK HERE FOR THE FULL BLOG ARCHIVES

Show more

Sign Up to Our Mailing List

Sign Up to Our Mailing List
Banner displaying the text 'Sign Up to Our Mailing List - Marie Landry's Spy Shop' with a call-to-action to join the mailing list, promoting exclusive updates and offers from a spy gear and surveillance equipment store.

The SpyPlan™ Business Plan (100$)

The SpyPlanâ„¢ Business Plan (100$)
The SpyPlan™ combines AI precision with real-world OSINT (Open-Source Intelligence) to create your custom business plan—crafted from a short interview and delivered in a polished format ready for investors, grants, or strategic scaling.

My Scribd Uploads

My Scribd Uploads
Explore 1000+ Groundbreaking Uploads

My Shared Public Google Drive [OSINT]

My Shared Public Google Drive [OSINT]
Banner displaying the text 'My Shared Public Google Drive [OSINT]' with a clean, minimalist background, representing file sharing and open-source intelligence resources, promoting access to a publicly available Google Drive folder for OSINT materials.

My Poe.com AI Models

My Poe.com AI Models
poe.com/marielandryceo

My Custom GPTs on OpenAI

My Custom GPTs on OpenAI
AI Models on OpenAI