{"id":814,"date":"2023-09-11T09:47:06","date_gmt":"2023-09-11T09:47:06","guid":{"rendered":"https:\/\/sciencesetrecherches.eu\/?p=814"},"modified":"2023-09-11T09:48:42","modified_gmt":"2023-09-11T09:48:42","slug":"falcon-180b-un-modele-dia-open-source-avec-180-milliards-de-parametres-entraines-sur-3-500-milliards-de-jetons","status":"publish","type":"post","link":"https:\/\/sciencesetrecherches.eu\/?p=814","title":{"rendered":"Falcon 180B\u00a0: un mod\u00e8le d&#8217;IA open source avec 180 milliards de param\u00e8tres entra\u00een\u00e9s sur 3\u00a0500 milliards de jetons"},"content":{"rendered":"\n<div id=\"wp-block-themeisle-blocks-circle-counter-83e8cb28\" data-percentage=\"50\" data-duration=\"2\" data-height=\"100\" data-stroke-width=\"10\" class=\"wp-block-themeisle-blocks-circle-counter\"><div class=\"wp-block-themeisle-blocks-circle-counter-title__area\"><span class=\"wp-block-themeisle-blocks-circle-counter-title__value\">Skill<\/span><\/div><div class=\"wp-block-themeisle-blocks-circle-counter__bar\"><\/div><\/div>\n\n\n\n<figure class=\"wp-block-embed is-type-video is-provider-youtube wp-block-embed-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio\"><div class=\"wp-block-embed__wrapper\">\n<iframe loading=\"lazy\" title=\"Introducing Falcon 180B: The World&#039;s Most Powerful Open LLM!\" width=\"1104\" height=\"621\" src=\"https:\/\/www.youtube.com\/embed\/9MArp9H2YCM?feature=oembed\" frameborder=\"0\" allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share\" allowfullscreen><\/iframe>\n<\/div><\/figure>\n\n\n\n<p class=\"wp-block-paragraph\">Falcon 180B surpasse LLaMA 2 et d&#8217;autres mod\u00e8les \u00e0 la fois en mati\u00e8re d&#8217;\u00e9chelle et de performances dans toute une s\u00e9rie de t\u00e2ches de traitement du langage naturel (NLP). Falcon 180B se classe au premier rang du classement Hugging Face des mod\u00e8les en libre acc\u00e8s avec 68,74 points et atteint presque la parit\u00e9 avec des mod\u00e8les commerciaux comme le PaLM-2 de Google sur des \u00e9valuations telles que le benchmark HellaSwag. Plus pr\u00e9cis\u00e9ment, les donn\u00e9es de l&#8217;\u00e9quipe montrent que Falcon 180B \u00e9gale ou d\u00e9passe PaLM-2 Medium sur des crit\u00e8res de r\u00e9f\u00e9rence couramment utilis\u00e9s, notamment HellaSwag, LAMBADA, WebQuestions, Winogrande, etc.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Falcon 180B est pratiquement au m\u00eame niveau que PaLM-2 Large de Google. Cela repr\u00e9sente une performance extr\u00eamement forte pour un mod\u00e8le open source, m\u00eame lorsqu&#8217;il est compar\u00e9 \u00e0 des solutions d\u00e9velopp\u00e9es par des g\u00e9ants de l&#8217;industrie. Compar\u00e9 \u00e0 ChatGPT, le mod\u00e8le serait plus puissant que la version gratuite, mais moins performant que le service payant ChatGPT Plus, lanc\u00e9 au d\u00e9but de l&#8217;ann\u00e9e. \u00ab Falcon 180B se situe entre GPT-3.5 et GPT-4 en fonction du benchmark d&#8217;\u00e9valuation, et il sera tr\u00e8s int\u00e9ressant de suivre les am\u00e9liorations apport\u00e9es par la communaut\u00e9 maintenant qu&#8217;il est disponible en libre acc\u00e8s \u00bb, explique l&#8217;\u00e9quipe de l&#8217;Institut.<\/p>\n\n\n\n<p class=\"wp-block-paragraph\">Pour les personnes recherchant des capacit\u00e9s conversationnelles pr\u00eates \u00e0 l&#8217;emploi, le TII propose Falcon 180B-Chat, un d\u00e9riv\u00e9 du Falcon 180B, affin\u00e9 sur un m\u00e9lange d&#8217;ensembles de donn\u00e9es de chat. La variante Chat pr\u00e9sente ses propres avantages, avec une architecture optimis\u00e9e pour l&#8217;inf\u00e9rence. Cependant, elle n&#8217;est pas id\u00e9ale pour ceux qui cherchent \u00e0 affiner le mod\u00e8le pour des t\u00e2ches sp\u00e9cifiques d&#8217;instruction ou de conversation. Falcon 180B est maintenant disponible gratuitement sur le portail Hugging Face, et le TII d&#8217;Abu Dhabi a d\u00e9clar\u00e9 mercredi que le nouveau mod\u00e8le d&#8217;IA pouvait \u00eatre utilis\u00e9 \u00e0 des fins de recherche et de commercialisation.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Falcon 180B surpasse LLaMA 2 et d&#8217;autres mod\u00e8les \u00e0 la fois en mati\u00e8re d&#8217;\u00e9chelle et de performances dans toute une s\u00e9rie de t\u00e2ches de traitement du langage naturel (NLP). Falcon 180B se classe au premier rang du classement Hugging Face des mod\u00e8les en libre acc\u00e8s avec 68,74 points et atteint presque la parit\u00e9 avec des [&hellip;]<\/p>\n","protected":false},"author":1,"featured_media":817,"comment_status":"closed","ping_status":"closed","sticky":true,"template":"","format":"video","meta":{"_themeisle_gutenberg_block_has_review":false,"footnotes":""},"categories":[67,31,29,30],"tags":[69,70,57],"series":[],"class_list":["post-814","post","type-post","status-publish","format-video","has-post-thumbnail","hentry","category-falcon-180b","category-gpt","category-ia","category-intelligence-artificielle","tag-180b","tag-3500-jetons","tag-falcon","post_format-post-format-video"],"_links":{"self":[{"href":"https:\/\/sciencesetrecherches.eu\/index.php?rest_route=\/wp\/v2\/posts\/814","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/sciencesetrecherches.eu\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/sciencesetrecherches.eu\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/sciencesetrecherches.eu\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/sciencesetrecherches.eu\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=814"}],"version-history":[{"count":1,"href":"https:\/\/sciencesetrecherches.eu\/index.php?rest_route=\/wp\/v2\/posts\/814\/revisions"}],"predecessor-version":[{"id":815,"href":"https:\/\/sciencesetrecherches.eu\/index.php?rest_route=\/wp\/v2\/posts\/814\/revisions\/815"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/sciencesetrecherches.eu\/index.php?rest_route=\/wp\/v2\/media\/817"}],"wp:attachment":[{"href":"https:\/\/sciencesetrecherches.eu\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=814"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/sciencesetrecherches.eu\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=814"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/sciencesetrecherches.eu\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=814"},{"taxonomy":"series","embeddable":true,"href":"https:\/\/sciencesetrecherches.eu\/index.php?rest_route=%2Fwp%2Fv2%2Fseries&post=814"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}