{"id":11239,"date":"2024-06-11T07:54:16","date_gmt":"2024-06-11T14:54:16","guid":{"rendered":"https:\/\/mattfife.com\/?p=11239"},"modified":"2024-08-31T18:06:01","modified_gmt":"2024-09-01T01:06:01","slug":"getting-closer-to-synthetic-people","status":"publish","type":"post","link":"https:\/\/mattfife.com\/?p=11239","title":{"rendered":"Getting closer to synthetic people"},"content":{"rendered":"\n<p>Microsoft has <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/project\/vasa-1\/\" data-type=\"link\" data-id=\"https:\/\/www.microsoft.com\/en-us\/research\/project\/vasa-1\/\">released a fascinating new framework for generating lifelike talking faces<\/a> called VASA-1.<\/p>\n\n\n\n<p>Given a single static image and a speech audio clip, VASA-1 is capable of producing lip movements that are synchronized with the audio and capture a large spectrum of facial nuances and natural head motions.<\/p>\n\n\n\n<figure class=\"wp-block-embed is-type-video is-provider-youtube wp-block-embed-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio\"><div class=\"wp-block-embed__wrapper\">\n<span class=\"embed-youtube\" style=\"text-align:center; display: block;\"><iframe loading=\"lazy\" class=\"youtube-player\" width=\"640\" height=\"360\" src=\"https:\/\/www.youtube.com\/embed\/fTAuzFzMt5Y?version=3&#038;rel=1&#038;showsearch=0&#038;showinfo=1&#038;iv_load_policy=1&#038;fs=1&#038;hl=en-US&#038;autohide=2&#038;wmode=transparent\" allowfullscreen=\"true\" style=\"border:0;\" sandbox=\"allow-scripts allow-same-origin allow-popups allow-presentation allow-popups-to-escape-sandbox\"><\/iframe><\/span>\n<\/div><\/figure>\n\n\n\n<p><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/project\/vasa-1\/\" data-type=\"link\" data-id=\"https:\/\/www.microsoft.com\/en-us\/research\/project\/vasa-1\/\">See more here,<\/a> read the paper <a href=\"https:\/\/arxiv.org\/pdf\/2404.10667.pdf\" data-type=\"link\" data-id=\"https:\/\/arxiv.org\/pdf\/2404.10667.pdf\">here <\/a>and <a href=\"https:\/\/arxiv.org\/abs\/2404.10667\" data-type=\"link\" data-id=\"https:\/\/arxiv.org\/abs\/2404.10667\">here<\/a>.<\/p>\n\n\n\n<p>Getting worried you&#8217;ll be replaced by AI yet? If this gets perfected (it&#8217;s not perfect yet, but the results get better and better each year), then you can pretty much get rid of any &#8216;talking head&#8217; jobs. <\/p>\n\n\n\n<p>This could also be used to fool people on conference calls where video quality would totally render any minor glitches as unnoticeable or easily ignored as just streaming artifacts.<\/p>\n\n\n\n<p>Just <a href=\"https:\/\/www.apple.com\/leadership\/images\/bio\/tim-cook_image.png.og.png?1712850301825\" data-type=\"link\" data-id=\"https:\/\/www.apple.com\/leadership\/images\/bio\/tim-cook_image.png.og.png?1712850301825\">slap the CEO&#8217;s face<\/a> into this, set up a conference call with finance via some very easy phishing, and approve that $1m transfer to your Swiss bank account. <\/p>\n\n\n\n<p>Articles:<\/p>\n\n\n\n<ul class=\"wp-block-list\">\n<li><a href=\"https:\/\/www.engadget.com\/microsofts-ai-tool-can-turn-photos-into-realistic-videos-of-people-talking-and-singing-070052240.html?guccounter=1\">https:\/\/www.engadget.com\/microsofts-ai-tool-can-turn-photos-into-realistic-videos-of-people-talking-and-singing-070052240.html?guccounter=1<\/a><\/li>\n<\/ul>\n\n\n\n<p><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Microsoft has released a fascinating new framework for generating lifelike talking faces called VASA-1. Given a single static image and a speech audio clip, VASA-1 is capable of producing lip movements that are synchronized with the audio and capture a large spectrum of facial nuances and natural head motions. See more here, read the paper here and here. Getting worried you&#8217;ll be replaced by AI yet? If this gets perfected (it&#8217;s not perfect yet, but the results get better and&#8230;<\/p>\n<p class=\"read-more\"><a class=\"btn btn-default\" href=\"https:\/\/mattfife.com\/?p=11239\"> Read More<span class=\"screen-reader-text\">  Read More<\/span><\/a><\/p>\n","protected":false},"author":2,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"jetpack_post_was_ever_published":false,"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_publicize_message":"","jetpack_publicize_feature_enabled":true,"jetpack_social_post_already_shared":true,"jetpack_social_options":{"image_generator_settings":{"template":"highway","default_image_id":0,"font":"","enabled":false},"version":2}},"categories":[28,9,14],"tags":[],"class_list":["post-11239","post","type-post","status-publish","format-standard","hentry","category-ai","category-cool","category-photography"],"jetpack_publicize_connections":[],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/p4WECr-2Vh","jetpack-related-posts":[],"_links":{"self":[{"href":"https:\/\/mattfife.com\/index.php?rest_route=\/wp\/v2\/posts\/11239","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/mattfife.com\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/mattfife.com\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/mattfife.com\/index.php?rest_route=\/wp\/v2\/users\/2"}],"replies":[{"embeddable":true,"href":"https:\/\/mattfife.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=11239"}],"version-history":[{"count":4,"href":"https:\/\/mattfife.com\/index.php?rest_route=\/wp\/v2\/posts\/11239\/revisions"}],"predecessor-version":[{"id":12215,"href":"https:\/\/mattfife.com\/index.php?rest_route=\/wp\/v2\/posts\/11239\/revisions\/12215"}],"wp:attachment":[{"href":"https:\/\/mattfife.com\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=11239"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/mattfife.com\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=11239"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/mattfife.com\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=11239"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}