---------- Forwarded message ---------
From:
Youssef Hosni from To Data & Beyond <youssefh@substack.com>Date: Wed, Jan 10, 2024 at 12:27 AM
Subject: Top Important LLM Papers for the Week from 01/01 to 07/01
To: <
marielandryx@gmail.com>
Stay Updated with Recent Large Language Models Research
| | |
| Stay Updated with Recent Large Language Models ResearchEvery week, several top-tier academic conferences and journals showcased innovative research in computer vision, presenting exciting breakthroughs in various subfields such as image recognition, vision model optimization, generative adversarial networks (GANs), image segmentation, video analysis, and more. This article provides a comprehensive overview of the most significant papers published in the first week of January 2024, highlighting the latest research and advancements in computer vision. Whether you're a researcher, practitioner, or enthusiast, this article will provide valuable insights into the state-of-the-art techniques and tools in computer vision. Table of Contents:LLM Progress & Benchmarking LLM Fine Tuning LLM Reasoning LLM Training & Evaluation Transformers & Attention Based Models
1. LLM Progress & BenchmarkingBoosting Large Language Model for Speech Synthesis: An Empirical Study LARP: Language-Agent Role Play for Open-World Games Improving Text Embeddings with Large Language Models PanGu-$π$: Enhancing Language Model Architectures via Nonlinearity Compensation COSMO: COntrastive Streamlined MultimOdal Model with Interleaved Pre-Training TinyLlama: An Open-Source Small Language Model GeoGalactica: A Scientific Large Language Model in Geoscience Unicron: Economizing Self-Healing LLM Training at Scale LLaMA Beyond English: An Empirical Study on Language Capability Transfer DocLLM: A layout-aware generative language model for multimodal document understanding LLaMA Pro: Progressive LLaMA with Block Expansion LLM Augmented LLMs: Expanding Capabilities through Composition LLaVA-$φ$: Efficient Multi-Modal Assistant with Small Language Model A Comprehensive Study of Knowledge Editing for Large Language Models
2. LLM Fine TuningSelf-Play Fine-Tuning Converts Weak Language Models to Strong Language Models Astraios: Parameter-Efficient Instruction Tuning Code Large Language Models
3. LLM ReasoningTowards Truly Zero-shot Compositional Visual Reasoning with LLMs as Programmers Gemini in Reasoning: Unveiling Commonsense in Multimodal Large Language Models
4. LLM Training & EvaluationBeyond Chinchilla-Optimal: Accounting for Inference in Language Model Scaling Laws Understanding LLMs: A Comprehensive Overview from Training to Inference
5. Transformers & Attention Based ModelsBoundary Attention: Learning to Find Faint Boundaries at Any Resolution ICE-GRT: Instruction Context Enhancement by Generative Reinforcement-based Transformers
To Data & Beyond is a reader-supported publication. To receive new posts and support my work, consider becoming a free or paid subscriber. Are you looking to start a career in data science and AI and do not know how? I offer data science mentoring sessions and long-term career mentoring:Invite your friends and earn rewardsIf you enjoy To Data & Beyond, share it with your friends and earn rewards when they subscribe. Invite Friends | |
Comments
Post a Comment