Home > Technology peripherals > AI > body text

Capability alignment, long text, Claude 3, this time we will talk about the key technical paths of large models

WBOY
Release: 2024-08-05 14:01:32
Original
1133 people have browsed it

The large text model has reached a new level. Claude 3 surpasses GPT-4 and Gemini 1.0 Ultra, which was launched less than a month ago, in multiple dimensions such as mathematics, programming, multi-language understanding, and vision. "Rapidly changing" is no longer enough to describe the current development trend of large model technology. In order to better share the latest progress in large model technology, in 2024, this site, Zhangjiang Science and Technology Investment, Zhangjiang Incubator, and WAIC Circle jointly launched the "Large Model Technology Workshop" series of activities, inviting frontline experts from industry, academia, and research to bring cutting-edge observations and insights . On the afternoon of March 22, on the 3rd floor of Building A, Kehai Building, No. 800 Naxian Road, Zhangjiang, Shanghai, the theme was "Claude 3 The heat wave is coming, let’s talk about the key technical paths of text large models", from Fudan University, Waveform Intelligence, Amazon Cloud technology scholars and technical experts will conduct in-depth sharing and exchanges. Professional audiences who are concerned about the progress of large models are welcome to join the event and communicate and discuss together.

能力对齐、长文本、Claude 3,这次聊聊大模型重点技术路径

Guest introduction

能力对齐、长文本、Claude 3,这次聊聊大模型重点技术路径

Speech title: Large model capability alignment

Speaker:

  • Gui Yu

Associate researcher at Fudan University Natural Language Processing Laboratory

Research field:

  • Pre-trained model
  • Human-like alignment
  • Agent interaction

Academic achievements:

  • Published more than 50 papers in high-level international academic journals and conferences
  • Host multiple talent projects (National Natural Science Foundation of China) , Computer Society, Artificial Intelligence Society)
  • Awards won:

    • Qian Weichang First Prize in Chinese Information Processing Science and Technology Award
    • NeurIPS2023 Large Model Alignment Track Best Paper Award
    • COLING2018 Best Paper Nomination Award
    • NLPCC2019 Outstanding Paper Award
    • CIPS Excellent Paper Award
    • ACM Excellent Paper Award
  • Selected:

    • China Association for Science and Technology Youth Talent Promotion Project
    • Shanghai Morning Star Program
    • World Artificial Intelligence Conference Yunfan Award "Bright Star"

      能力对齐、长文本、Claude 3,这次聊聊大模型重点技术路径

      Speech title: Training and inference solution for large models of ultra-long text creative writing

Speaker:

Zhou Wangchunshu, CTO of Waveform Intelligence.

  • Graduated from the Sino-French Engineering College of Beihang University with a bachelor's degree and a master's degree
  • Ph.D. studied at ETH Zurich, studying under Ryan Cotterell & Mrinmaya Sachan
  • Dropped out of school in April 2023 and founded AIWaves, serving as the company's Cofounder & CTO
  • The research directions mainly include:

    • LLM training & prompting
    • language agents
    • long/creative text generation
    • efficient methods for NLP
    • multi-modal LLMs
    • commonsense reasoning etc.
  • Received Baidu Scholarship in 2022
  • Worked as an intern at MSRA/Byte AI Lab/AI2 and other institutions, and served as a research scientist at Bytedance AI Lab
  • Zhou Wang Chunshu has worked in machine learning and research fields such as NeurIPS/ICML/ICLR/ACL/EMNLP/NAACL He has published more than 30 articles in natural language processing conferences, and serves as a reviewer for these conferences and as the Action Editor/Area Chair of ARR/*ACL.

    能力对齐、长文本、Claude 3,这次聊聊大模型重点技术路径

    Speech title: Claude 3 technical analysis and scenario demonstration

Speaker:

Lin Ye, senior solution architect of Amazon Cloud Technology. Good at C++/C#/Java/PHP/Python/JS and other development languages, and has continuously developed a Github repo from single digits to 3000. He has built a shared bicycle APP that supports 10 million users, participated in the development of a number of well-known car company APPs, and won the Zhejiang ACM Award in 2005. Now he focuses on the development of enterprise cloud native architecture and GenAI, and is committed to applying his capabilities to enterprises. Business scene.

Event Registration

Registration for the "Large Model Technology Workshop Phase 1" has been opened. Scan the QR code below or click "Read Original" at the bottom to go directly to the event registration page.

能力对齐、长文本、Claude 3,这次聊聊大模型重点技术路径

For questions related to this event, you are welcome to add our assistant (ID: 13661489516) or consult via email (chenyinyi@jiqizhixin.com).

The above is the detailed content of Capability alignment, long text, Claude 3, this time we will talk about the key technical paths of large models. For more information, please follow other related articles on the PHP Chinese website!

Related labels:
source:jiqizhixin.com
Statement of this Website
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn
Popular Tutorials
More>
Latest Downloads
More>
Web Effects
Website Source Code
Website Materials
Front End Template