current location:Home > Technical Articles > Technology peripherals > AI

  • ICLR 2024 Spotlight | NoiseDiffusion: Correct diffusion model noise and improve interpolation image quality
    ICLR 2024 Spotlight | NoiseDiffusion: Correct diffusion model noise and improve interpolation image quality
    Author|PengfeiZheng Unit|USTC,HKBUTMLRGroup In recent years, the rapid development of generative AI has injected strong impetus into eye-catching fields such as text-to-image generation and video generation. The core of these techniques lies in the application of diffusion models. The diffusion model first gradually changes the picture into Gaussian noise by defining a forward process that continuously adds noise, and then gradually denoises the Gaussian noise through a reverse process and turns it into a clear picture to obtain real samples. The diffusion ordinary differential model is used to interpolate the values ​​of the generated images, which has great application potential in generating videos and some advertising creatives. However, we noticed that when this method is applied to natural images, the interpolated image effects are often unsatisfactory. exist
    AI 1056 2024-05-06 14:01:24
  • AI learns to hide its thinking and reason secretly! Solving complex tasks without relying on human experience is more black box
    AI learns to hide its thinking and reason secretly! Solving complex tasks without relying on human experience is more black box
    When AI does math problems, the real thinking is actually "mental arithmetic" secretly? A new study by a team from New York University found that even if AI is not allowed to write steps and is replaced with meaningless "...", its performance on some complex tasks can be greatly improved! One author, JacabPfau, said: As long as you spend computing power to generate additional tokens, you can bring advantages. It doesn’t matter what token you choose. For example, let Llama34M answer a simple question: How many of the first 6 digits of the natural constant e are greater than 5? The AI's direct answer is equivalent to messing around. It only counts the first 6 digits and actually counts 7. Let AI write out the steps to verify each number, and you can get the correct answer. Let AI hide the steps and replace them with a large number of "...
    AI 900 2024-05-06 12:00:30
  • Stanford Li Feifei started his first business: two years of academic leave, aiming at 'spatial intelligence'
    Stanford Li Feifei started his first business: two years of academic leave, aiming at 'spatial intelligence'
    "AI Godmother" Li Feifei started a business. Unexpectedly, in the era of large models, the well-known "AI Godmother" Li Feifei would also "start a business" and completed a seed round of financing. According to an exclusive report from Reuters, famous computer scientist Li Feifei is creating a start-up company. The company leverages human-like visual data processing to enable artificial intelligence to perform advanced reasoning. People familiar with the matter revealed that Li Feifei recently raised a seed round of financing for the company, with investors including Silicon Valley venture capital firm Andreessen Horowitz and Canadian company Radical Ventures, which she joined last year. However, spokespersons for both Andreessen Horowitz and Radical Ventures confirmed this.
    AI 1088 2024-05-05 13:04:06
  • Walking the 'dog' on the yoga ball! Eureka, selected as one of NVIDIA's top ten projects, has made a new breakthrough
    Walking the 'dog' on the yoga ball! Eureka, selected as one of NVIDIA's top ten projects, has made a new breakthrough
    The robot dog walks steadily on the yoga ball, and its balance is quite good: it can handle various scenes, whether it is a flat sidewalk or a challenging lawn: it can even be kicked by researchers. Even with a yoga ball on its feet, the robot dog will not tip over. Even if the balloon is deflated, the robot dog can maintain balance: The above demonstrations are all at 1x speed and have not been accelerated. Paper address: https://eureka-research.github.io/dr-eureka/assets/dreureka-paper.pdf Project homepage: https://github.com/eureka-research/DrEureka paper title: DrE
    AI 531 2024-05-05 13:01:01
  • The performance of small models is saturated and the performance is poor. Is the root cause due to Softmax?
    The performance of small models is saturated and the performance is poor. Is the root cause due to Softmax?
    The emergence of small language models is to make up for the disadvantages of expensive training and inference of large language models. However, it also has the fact that its performance declines after training to a certain stage (saturation phenomenon). So what is the reason for this phenomenon? Can it be overcome and exploited to improve the performance of small language models? The latest progress in the field of language modeling consists in pre-training highly parameterized neural networks on extremely large-scale web text corpora. In practice, using such a model for training and inference can be costly, prompting the use of smaller alternative models. However, it has been observed that smaller models may suffer from saturation and a phenomenon characterized by a decrease in capability and plateauing at some advanced stage of training. A recent paper found that this saturation sum phenomenon can be reduced by smaller models
    AI 1010 2024-05-04 13:10:01
  • Finally, someone investigated the overfitting of small models: two-thirds of them had data pollution, and Microsoft Phi-3 and Mixtral 8x22B were named
    Finally, someone investigated the overfitting of small models: two-thirds of them had data pollution, and Microsoft Phi-3 and Mixtral 8x22B were named
    Two-thirds of the most popular large models currently have overfitting problems? A study that just came out surprised many researchers in the field. Improving the reasoning capabilities of large language models is one of the most important directions of current research. In this type of task, many small models recently released seem to perform well and can handle such tasks well. For example, Microsoft's Phi-3, Mistral8x22B and other models. The researchers pointed out that there is a key problem in the current field of large model research: many studies fail to accurately benchmark the capabilities of existing LLMs. This suggests that we need to spend more time evaluating and testing the current LLM capability level. This is because most current research uses GSM8k, MATH, M
    AI 450 2024-05-04 13:05:13
  • A relay spanning more than 300 years: Inspired by Terence Teru, mathematicians decided to use AI to formalize the proof of Fermat's Last Theorem.
    A relay spanning more than 300 years: Inspired by Terence Teru, mathematicians decided to use AI to formalize the proof of Fermat's Last Theorem.
    Inspired by Terence Tao, more and more mathematicians began to try to use artificial intelligence to conduct mathematical exploration. This time, their target is Fermat's Last Theorem, one of the world's top ten most difficult mathematical problems. Fermat's Last Theorem is a very complex mathematical problem for which no feasible solution has been found so far. Mathematicians hope that with the powerful computing power and intelligent algorithms of artificial intelligence, they can explore Fermat's Last Theorem in mathematics. Also known as "Fermat's Last Theorem (FLT)", it was invented by the 17th-century French mathematician Pierre.・De Fermat proposed. There is a legendary story behind it. It is said that around 1637, when Fermat was reading the Latin translation of Diophantus' Arithmetic, he wrote next to the 8th proposition of Volume 11
    AI 593 2024-05-03 13:04:01
  • Transformer wants to become Kansformer? It has taken decades for MLP to usher in challenger KAN
    Transformer wants to become Kansformer? It has taken decades for MLP to usher in challenger KAN
    MLP (Multilayer Perceptron) has been used for decades. Is there really no other choice? Multilayer perceptrons (MLPs), also known as fully connected feedforward neural networks, are the fundamental building blocks of today's deep learning models. The importance of MLPs cannot be overstated, as they are the default method for approximating nonlinear functions in machine learning. However, is MLP the best nonlinear regressor we can build? Although MLPs are widely used, they have significant drawbacks. For example, in Transformer models, MLPs consume almost all non-embedded parameters and are generally less interpretable relative to attention layers without post-processing analysis tools. So, is there an alternative to MLP?
    AI 827 2024-05-03 13:01:04
  • The hottest generative AI hardware has sold more than 100,000 units. After tearing it apart, it turns out it's just an Android app?
    The hottest generative AI hardware has sold more than 100,000 units. After tearing it apart, it turns out it's just an Android app?
    "RabbitR1, it is essentially a Launcher program on the Android system. After being cracked, it can run on the phone." Through cracking, Rahman managed to start and run the R1 application on the Pixel6a phone. On Tuesday, American journalist Mishaal Rahman exposed the details of the well-known generative AI hardware RabbitR1, which immediately attracted the attention of the technology circle. A few months ago, two startups, Humane and Rabbit, continued to launch their artificial intelligence devices - AiPin and RabbitR1. Initially, some believed these devices would usher in a new era of wearable artificial intelligence. However, as the months passed, controversy grew over the two devices. R
    AI 728 2024-05-02 16:01:19
  • Yu Chengdong steps down as CEO of Huawei Terminal BG, He Gang will take over
    Yu Chengdong steps down as CEO of Huawei Terminal BG, He Gang will take over
    According to multiple media reports, Huawei internally issued a personnel adjustment document on the afternoon of April 30, announcing that Yu Chengdong would step down as CEO of Huawei Terminal BG. Yu Chengdong will remain as chairman of Terminal BG. He Gang, the former Huawei Terminal BG and Chief Operating Officer, will take over the position of CEO of Huawei Terminal BG. According to reports, apart from the above-mentioned personal changes and adjustments, the document does not contain any more information. There is no further explanation on the background of this major personnel change and Yu Chengdong’s new business focus after stepping down as CEO of Terminal BG. Some sources said that this adjustment is a routine business structure adjustment, which will allow Yu Chengdong to have more energy to create high-quality products for consumers. Yu Chengdong was born in 1969. He graduated from the Automatic Control Department of Northwestern Polytechnical University with a bachelor's degree and a master's degree from Tsinghua University.
    AI 524 2024-05-02 16:01:14
  • How can OctopusV3, with less than 1 billion parameters, compare with GPT-4V and GPT-4?
    How can OctopusV3, with less than 1 billion parameters, compare with GPT-4V and GPT-4?
    The characteristic of multi-modal AI systems is that they can process and learn various types of data including natural language, vision, audio, etc., to guide their behavioral decisions. Recently, research on incorporating visual data into large language models (such as GPT-4V) has made important progress, but how to effectively convert image information into executable operations for AI systems still faces challenges. In order to realize the transformation of image information, a common method is to convert image data into corresponding text descriptions, and then the AI ​​system operates based on the descriptions. This can be done by performing supervised learning on existing image data sets, allowing the AI ​​system to automatically learn the image-to-text mapping relationship. In addition, reinforcement learning methods can also be used to learn how to make decisions based on image information by interacting with the environment. another
    AI 596 2024-05-02 16:01:01
  • In 12 video understanding tasks, Mamba first defeated Transformer
    In 12 video understanding tasks, Mamba first defeated Transformer
    This site publishes columns with academic and technical content. In recent years, the AIxiv column of this site has received more than 2,000 reports, covering top laboratories from major universities and companies around the world, effectively promoting academic exchanges and dissemination. If you have excellent work that you want to share, please feel free to contribute or contact us for reporting. Submission email: liyazhou@jiqizhixin.com; zhaoyunfeng@jiqizhixin.com. Exploring a new realm of video understanding, the Mamba model leads a new trend in computer vision research! The limitations of traditional architecture have been broken. The state space model Mamba has brought revolutionary changes to the field of video understanding with its unique advantages in long sequence processing. From Nanjing University, Shanghai
    AI 1409 2024-05-01 08:20:15
  • Understanding GraphRAG (1): Challenges of RAG
    Understanding GraphRAG (1): Challenges of RAG
    RAG (RiskAssessmentGrid) is a method that enhances existing large language models (LLM) with external knowledge sources to provide more contextually relevant answers. In RAG, the retrieval component obtains additional information, the response is based on a specific source, and then feeds this information into the LLM prompt so that the LLM's response is based on this information (enhancement phase). RAG is more economical compared to other techniques such as trimming. It also has the advantage of reducing hallucinations by providing additional context based on this information (augmentation stage) - your RAG becomes the workflow method for today's LLM tasks (such as recommendation, text extraction, sentiment analysis, etc.). If we break this idea down further, based on user intent, we typically look at
    AI 1028 2024-04-30 19:10:01
  • Xiaohongshu made the intelligent agents quarrel! Jointly launched with Fudan University to launch exclusive group chat tool for large models
    Xiaohongshu made the intelligent agents quarrel! Jointly launched with Fudan University to launch exclusive group chat tool for large models
    Language is not only a collection of words, but also a carnival of emoticons, a sea of ​​memes, and a battlefield for keyboard warriors (eh? What’s wrong?). How does language shape our social behavior? How does our social structure evolve through constant verbal communication? Recently, researchers from Fudan University and Xiaohongshu conducted in-depth discussions on these issues by introducing a simulation platform called AgentGroupChat. The group chat function of social media such as WhatsApp is the inspiration for the AgentGroupChat platform. On the AgentGroupChat platform, Agents can simulate various chat scenarios in social groups to help researchers deeply understand the impact of language on human behavior. Should
    AI 1151 2024-04-30 18:40:23
  • GitHub version of Devin is online, you can develop applications if you can type, Microsoft CEO: Redefine IDE
    GitHub version of Devin is online, you can develop applications if you can type, Microsoft CEO: Redefine IDE
    Microsoft's "GitHub version of Devin" - Copilot WorkSpace, is finally online! WorkSpace is a new "Copilot native" development environment that aims to allow all developers to use natural language to transform ideas in their minds into applications. In other words, as long as you have an idea and can type, you can engage in software development. The all-natural language workflow of WorkSpace has also won it the title of "GitHub version of Devin" awarded by netizens. GitHub CEO Domke said that WorkSpace has surpassed Copilot's original functions and will redefine the developer experience. Microsoft CEO Nadella also mentioned again
    AI 543 2024-04-30 17:55:24

Tool Recommendations

jQuery enterprise message form contact code

jQuery enterprise message form contact code is a simple and practical enterprise message form and contact us introduction page code.
form button
2024-02-29

HTML5 MP3 music box playback effects

HTML5 MP3 music box playback special effect is an mp3 music player based on HTML5 css3 to create cute music box emoticons and click the switch button.

HTML5 cool particle animation navigation menu special effects

HTML5 cool particle animation navigation menu special effect is a special effect that changes color when the navigation menu is hovered by the mouse.
Menu navigation
2024-02-29

jQuery visual form drag and drop editing code

jQuery visual form drag and drop editing code is a visual form based on jQuery and bootstrap framework.
form button
2024-02-29

Organic fruit and vegetable supplier web template Bootstrap5

An organic fruit and vegetable supplier web template-Bootstrap5
Bootstrap template
2023-02-03

Bootstrap3 multifunctional data information background management responsive web page template-Novus

Bootstrap3 multifunctional data information background management responsive web page template-Novus
backend template
2023-02-02

Real estate resource service platform web page template Bootstrap5

Real estate resource service platform web page template Bootstrap5
Bootstrap template
2023-02-02

Simple resume information web template Bootstrap4

Simple resume information web template Bootstrap4
Bootstrap template
2023-02-02

Cute summer elements vector material (EPS PNG)

This is a cute summer element vector material, including the sun, sun hat, coconut tree, bikini, airplane, watermelon, ice cream, ice cream, cold drink, swimming ring, flip-flops, pineapple, conch, shell, starfish, crab, Lemons, sunscreen, sunglasses, etc., the materials are provided in EPS and PNG formats, including JPG previews.
PNG material
2024-05-09

Four red 2023 graduation badges vector material (AI EPS PNG)

This is a red 2023 graduation badge vector material, four in total, available in AI, EPS and PNG formats, including JPG preview.
PNG material
2024-02-29

Singing bird and cart filled with flowers design spring banner vector material (AI EPS)

This is a spring banner vector material designed with singing birds and a cart full of flowers. It is available in AI and EPS formats, including JPG preview.
banner picture
2024-02-29

Golden graduation cap vector material (EPS PNG)

This is a golden graduation cap vector material, available in EPS and PNG formats, including JPG preview.
PNG material
2024-02-27

Home Decor Cleaning and Repair Service Company Website Template

Home Decoration Cleaning and Maintenance Service Company Website Template is a website template download suitable for promotional websites that provide home decoration, cleaning, maintenance and other service organizations. Tip: This template calls the Google font library, and the page may open slowly.
Front-end template
2024-05-09

Fresh color personal resume guide page template

Fresh color matching personal job application resume guide page template is a personal job search resume work display guide page web template download suitable for fresh color matching style. Tip: This template calls the Google font library, and the page may open slowly.
Front-end template
2024-02-29

Designer Creative Job Resume Web Template

Designer Creative Job Resume Web Template is a downloadable web template for personal job resume display suitable for various designer positions. Tip: This template calls the Google font library, and the page may open slowly.
Front-end template
2024-02-28

Modern engineering construction company website template

The modern engineering and construction company website template is a downloadable website template suitable for promotion of the engineering and construction service industry. Tip: This template calls the Google font library, and the page may open slowly.
Front-end template
2024-02-28
About us Disclaimer Sitemap
php.cn:Public welfare online PHP training,Help PHP learners grow quickly!