Using artificial intelligence to automate content moderation and compliance with AWS services
Using artificial intelligence to automate content moderation and compliance with AWS services
Customers are learning that content moderation processes by humans alone cannot scale to meet safety, regulatory, and operational needs in the user-generated content era. Artificial intelligence can help gaming, social media, e-commerce, and advertising organizations moderate the deluge of content to reclaim up to 95% of the time their teams spend moderating content manually. Financial, healthcare, and education organizations can streamline the detection and protection of personally identifiable information (PII) across environments and processes.
Learning Objectives: * Objective 1: Scale to moderate high volumes of User Generated Content (UGC) efficiently. * Objective 2: Save content moderation time and costs. * Objective 3: Get started to streamline your content moderation processes.
☁️ AWS Online Tech Talks cover a wide range of topics and expertise levels through technical deep dives, demos, customer examples, and live Q\u0026A with AWS experts. Builders can choose from bite-sized 15-minute sessions, insightful fireside chats, immersive virtual workshops, interactive office hours, or watch on-demand tech talks at your own pace. Join us to fuel your learning journey with AWS.
#AWS
Content
2.14 -> [Music]
7.279 -> hi my name is john rouse i'm the
9.2 -> worldwide business development manager
11.36 -> for aws and joining me for today's
14.16 -> webinar using ai to automate content
16.96 -> moderation and compliance with aws
19.68 -> services is nate bochmeier senior
22.16 -> solution architect from aws
24.56 -> as well as ori milinova director of ai
27.119 -> and data science software
30 -> today we're going to talk about creating
31.679 -> safe online environments protect your
33.76 -> brand and minimize moderation costs
37.36 -> first we're going to take a look at user
38.96 -> generated content
41.44 -> then nate is going to show
43.44 -> how to bring in aws ai services together
46.8 -> with a demo
48.879 -> yuri our partner from software is going
50.559 -> to come and talk about how they can help
52.399 -> customers that have content moderation
54.399 -> needs
55.6 -> and then
56.64 -> we'll explain how you get started what
58.32 -> are the next steps
60.079 -> during this session please feel free to
62 -> send any questions during the
63.6 -> presentation as we have experts in the
65.6 -> background ready to help so let's jump
68.159 -> in
70.88 -> modern web and mobile platforms fuel
73.68 -> businesses and drive user engagement
75.92 -> through social features
78.32 -> the daily volume of user-generated
80.32 -> content ugc
82.799 -> and third-party content has been
84.799 -> increasing substantially in industries
88.08 -> such as social media
90 -> social gaming
91.68 -> online forums dating matrimonial and
95.52 -> photo sharing websites
98.479 -> in turn customers need to review audio
101.759 -> image video and text content to ensure
105.92 -> that their end users are not exposed to
107.84 -> potentially inappropriate or offensive
110.479 -> material
111.68 -> such as profanity
113.36 -> violence
114.64 -> drug use adult products nudity or
117.68 -> disturbing content
120.399 -> more than 86 percent of companies today
123.68 -> use ugc as part of their marketing
126.88 -> strategy
132.16 -> this diagram highlights that most modern
135.2 -> businesses are heavily dependent on ugc
139.2 -> and that ugc's growth is outpacing human
142.8 -> capacity
144 -> making content moderation inefficient
146.879 -> unnecessary
148.319 -> risky and expensive
151.92 -> the ugc platform industry is growing at
154.72 -> 26 percent
156.56 -> compounded annually growth rate
158.879 -> and is expected to reach 10 billion
161.84 -> by 2028.
164.239 -> 79 of consumers purchase decisions are
167.12 -> influenced by ugc
169.28 -> and 80 of all web content is ugc
173.519 -> from images on social media to reviews
175.84 -> of products consumers are dominating the
178.56 -> online space
184.4 -> from startups to enterprise customers
187.12 -> need to ensure that their end users are
189.599 -> not exposed to potentially inappropriate
192.56 -> or offensive material or disturbing
194.48 -> content
196.08 -> content moderation is fundamental in
198.56 -> protecting online communities their
201.04 -> members and members personal information
205.04 -> there are strong business reasons to
206.879 -> reconsider how your organization
209.68 -> moderates content
217.36 -> online community members expect safe and
220.4 -> inclu inclusive
222.319 -> experiences where they can freely
224.319 -> consume and contribute images video text
227.84 -> and audio
229.2 -> the ever increasing volume
231.36 -> variety and complexity of ugc
234.64 -> makes traditional human moderation
236.56 -> workflows challenging to scale to
238.959 -> protect users
241.92 -> these limitations force customers into
245.12 -> inefficient expensive
247.439 -> reactive
248.56 -> mitigation processes
250.72 -> that carry an unnecessary risk of users
254.4 -> and the business
256.479 -> the result is a poor harmful and
259.519 -> non-inclusive community experience
262.32 -> that disengages users
264.4 -> negatively impacts community
266.72 -> and business objectives
270.16 -> some statistics
271.6 -> about user generated content
273.919 -> over fifty percent of people said they
276.16 -> create content at least once daily
279.04 -> with twenty three percent saying that
280.8 -> they create more frequently two to five
283.36 -> times per day
285.44 -> more than 40 40 percent of respondents
288.24 -> will disengage from a brand's community
290.8 -> after as little as one exposure to toxic
294.08 -> or fake ugc
296.4 -> while 45 say they will lose all trust in
299.6 -> a brand
301.919 -> 70 of survey respondents stated that
304.4 -> brands need to protect users from toxic
306.8 -> content and 78
309.12 -> said that it's a brand's responsibility
311.44 -> to provide
312.72 -> positive and welcoming online experience
322.24 -> what are customers doing to cope with
324.08 -> cost volume and complexity tied to
327.28 -> moderation
328.88 -> they're turning to ai
330.88 -> ml and other technologies such as deep
333.84 -> learning dl
335.52 -> and natural language processing nlp
338.8 -> to cope with the increase in volume
341.36 -> complexity and the cost to moderate
344.24 -> content accurately and efficiently
349.039 -> customers in modern industries such as
351.199 -> social media fantasy sports and gaming
354 -> and others on traditional verticals
356.88 -> such as financial services and health
358.88 -> care
360.56 -> and facing the same needs and issues
363.44 -> the goal is to improve safe online
365.84 -> environment and reduce moderation costs
376.08 -> gone are the days where consumers
378 -> blindly accept content the way it's been
380.319 -> traditionally served
382.56 -> customers need to ensure that their end
385.28 -> users are not exposed to potentially
388.24 -> inappropriate
389.6 -> or offensive material or disturbing
392.08 -> content
394.08 -> the solution is
395.759 -> scalable content moderation workflows
399.52 -> that rely on artificial intelligence ai
402.96 -> and machine learning ml and deep
404.96 -> learning dl
406.639 -> in natural language processing and lp
408.88 -> technology
412.08 -> these constructs translate transcribe
414.88 -> recognize detect mask redact and
418.56 -> strategically bring human talent into
420.96 -> moderation workflow
423.36 -> to run the actions needed to keep users
425.919 -> safe and engage while increasing
428.08 -> accuracy
429.36 -> and process efficiently in lowering
432.16 -> operational costs
439.05 -> [Music]
440.8 -> aws content moderation ai services can
443.919 -> be leveraged to streamline and automate
447.52 -> your moderation workflows and lower
450 -> operational costs
451.599 -> in the process
454 -> customers can fully manage image video
457.28 -> text
458.319 -> and speech moderation apis
461.199 -> to proact to proactively detect
463.84 -> inappropriate
465.36 -> unwanted or offensive content at scale
468.96 -> and increase brand safety for you and
470.879 -> your partners
473.68 -> image moderation you can detect explicit
475.919 -> adult suggestive content in both image
478.479 -> and videos
480.08 -> video moderation labels are organized in
482.56 -> a hierarchical taxonomy
484.879 -> that provides both top level categories
487.44 -> such as suggestive
489.12 -> and nuanced second
490.96 -> level categories that identify the
493.759 -> specific types of content such as female
496.8 -> swimwear
498.56 -> or partial nudity using this information
501.36 -> you can create granular business rules
503.599 -> for different geographies
505.52 -> target audiences time of day and so on
509.759 -> text detection can be used for image and
511.84 -> video to read text and then check it
513.839 -> against your own list
515.519 -> of prohibited words or phrases
518.56 -> if you want to further analyze text you
521.279 -> can use nlp
525.2 -> audio moderation allows you to detect
527.279 -> profanities or hate speech in videos you
530.16 -> can convert speech to text and then
532.64 -> check it against similar lists
540.16 -> startups social media gaming
543.12 -> and other industries must ensure their
545.44 -> customers collaborate responsibly while
548.48 -> keeping operational costs down
551.2 -> businesses in the broadcasting and media
553.519 -> industries often find it difficult to
556.08 -> efficiently add ratings to content
558.24 -> pieces and formats to comply
561.04 -> with guidelines for different markets
563.04 -> and audiences
564.72 -> other organizations in financial and
566.959 -> health care services
569.2 -> find it challenging to protect personal
571.12 -> identifiable and health information pii
574.32 -> and phi across internal and external
577.44 -> environments and processes
580.24 -> addressing your content moderation needs
582.08 -> require a combination of computer vision
584.56 -> cv and text and language
587.279 -> transform and other ai and ml
589.36 -> capabilities to efficiently moderate the
592.48 -> increasing flux of user-generated
594.88 -> content and sensitive information
598.64 -> aw aws is ai services for moderation and
602.56 -> contextual insight and human in the loop
605.44 -> moderation
606.72 -> can be leveraged to streamline and
608.88 -> automate your image and video moderation
611.2 -> workflows
613.04 -> moderation across one two or more media
616.48 -> types
617.519 -> and data sources plus the addition of ai
620.56 -> insights such as sentiment
622.959 -> contacts and human reviews to either
626 -> guarantee
627.44 -> very high levels of accuracy and to
630.32 -> continue polishing your prediction
632.48 -> models
633.519 -> gives you a complete solution to solve
636.16 -> the most pressing
637.68 -> needs of the fastest growing ugc and
640.56 -> compliance use cases
643.839 -> adding ai insights and connections
646 -> moderation workflows with additional
648.399 -> contextual information
650.32 -> human reviews
651.92 -> smaller sets of moderators to correct
654.56 -> predictions or comply with specific
656.959 -> businesses
658.48 -> so let's start with the social media use
661.36 -> case
662.91 -> [Music]
666.24 -> social media with content moderation
668.24 -> prevents your user exposure to
670.079 -> inappropriate content on photo and video
672.8 -> sharing platforms
674.48 -> such as gaming communities and data
676.88 -> dating applications
678.8 -> these protections increase community
681.04 -> growth
682.399 -> session length conversion metrics and
685.36 -> other responsible social media
687.68 -> objectives and network matrix
691.92 -> this ensures that the content is not
694 -> just appropriate
695.519 -> it also
697.12 -> protects your audience from possible
699.12 -> bullying
700.079 -> or trolling by some
702.399 -> irrational users
704.399 -> but also aligns with your branding and
706.72 -> helps you achieve your overall business
709.92 -> goals
715.36 -> coffee meets bagel is a dating
717.519 -> application that serves potential
719.279 -> matches to over 1.5 million users daily
723.12 -> their motto is quality over quantity
726 -> because they focus on bringing a fun
728.32 -> safe and quality dating experience that
730.959 -> results in meaningful relationships
733.68 -> to deliver on these promises every match
735.92 -> they serve has to fulfill a strict set
738.72 -> of criterias that their users request
743.04 -> coffee meets bagels solution identifies
745.76 -> as user-generated photos that need
749.04 -> moderation and automatically reviews
751.44 -> photo libraries of any scale to detect
754.32 -> unsafe content and apply rules to meet
757.44 -> geographic requirements
760.639 -> they were able to lower moderation costs
762.959 -> by 72 percent
764.8 -> through 97 percent less human
767.12 -> involvement in moderation processes
769.92 -> and lower time for photo approvals from
772.8 -> hours to minutes
777.92 -> like with most communities
780.72 -> gaming companies are developing positive
783.04 -> play guidelines to help make sure your
785.76 -> games and services
788.079 -> are an enjoyable experience for all
790.48 -> players
792.16 -> prevent inappropriate content such as
794.24 -> hate speech profanity or bullying within
797.76 -> a game chat
799.2 -> additionally
800.56 -> moderating user generated values such as
803.2 -> nicknames and profiles
805.04 -> keeps gamers engaged and active and
807.12 -> without motive to leave the game's
809.44 -> ecosystem
812 -> whether you're new to gaming or have
813.76 -> been an active player for years
815.92 -> gaming companies need your help to make
818.24 -> this a community where we all want to be
821.04 -> part of
828.24 -> social is a leading mobile software
830.56 -> company focused on building social
833.279 -> networking and gaming apps
835.36 -> the company develops
837.6 -> omelette arcade a global community where
840.639 -> tens of millions of mobile gaming live
843.36 -> streamers and esport players gather to
846.32 -> share
847.199 -> game play and meet new friends
851.279 -> mobi social solution uses amazon
853.839 -> recognition moderation api
857.6 -> and they use a two-level hierarchical
860.399 -> taxonomy to label categories of
862.72 -> inappropriate or offensive content
866.88 -> they were able to reduce manual content
869.519 -> moderation by 95 percent
872.399 -> increase accuracy and scalability of
874.639 -> operations
875.92 -> and enabling their engineering resources
877.839 -> to focus on
879.68 -> core business areas
887.04 -> content moderation is a process of
888.8 -> monitoring user generated content such
891.279 -> as ecommerce sites
893.76 -> keep out illegal or controversial
896.24 -> product listings that violate compliance
898.959 -> policies that could incur both liability
902.32 -> and buyers and sellers churn
905.199 -> it is the appropriate way to safeguard
907.519 -> the brand's image and manage unwanted
910.24 -> content
912.079 -> in simple terms it's the process of
914.079 -> reviewing
915.199 -> filtering the social media content which
918.079 -> is in form
919.839 -> of con comments images and etc
923.68 -> so you ingest product descriptions and
925.68 -> listings from multiple markets you
927.6 -> detect content types you moderate
930.24 -> through pre-trained or custom models
932.8 -> examples would be in the real estate
934.88 -> auto or auction platforms
938.48 -> within content moderation the values are
940.56 -> it improves search engines ratings
943.759 -> proper content moderation improves your
945.6 -> search rankings organically and helps
947.6 -> businesses to flourish and get a better
950.16 -> online presence
952.48 -> increase brand reputation
954.8 -> as well as help to understand target
956.959 -> audience and scale promotion campaigns
963.839 -> 11th street is an online shopping
966.399 -> company
967.519 -> they're using content moderation to
969.839 -> automate the review of images and videos
973.36 -> as part of 11th street's interactive
975.759 -> experience and to empower their
977.839 -> community to express themselves
980.639 -> they have a feature where users can
982.639 -> submit a photo or video review of the
985.44 -> product that they have just purchased
987.92 -> example wearing
989.519 -> the new makeup
991.92 -> to make sure that no images or videos
993.839 -> contain content that is prohibited by
996.24 -> their platform guidelines they
998.16 -> originally resorted to a manual content
1000.24 -> moderation
1001.44 -> they quickly found that this was costly
1003.839 -> error prone and not scalable
1006.56 -> eleventh street solution now uses
1009.279 -> amazon recognition moderation labels and
1012.399 -> video apis
1014.72 -> to automate moderation of thousands of
1017.12 -> customer photo and video
1019.519 -> reviews daily
1022 -> 11th street now reviews more than 7 000
1025.12 -> images and videos every day
1027.52 -> with higher quality
1029.28 -> speed and at a lower cost when compared
1032.079 -> to its initial manual moderation
1034.319 -> approach
1039.28 -> within advertising
1042 -> content moderation helps avoid
1044.079 -> associations that increases the risk of
1047.12 -> public backlash due to unwanted
1049.84 -> association between your brand
1052.4 -> and ad
1053.44 -> or content within ads
1056.24 -> brand safety has become a major issue in
1058.96 -> the online media
1060.799 -> industry
1061.919 -> worldwide
1064.16 -> companies who value their brand should
1066.24 -> be concerned about what web pages their
1068.799 -> ads appear on
1070.32 -> and then next
1071.919 -> to what kind of content many advertisers
1074.64 -> assume that their brands are safe if
1076.48 -> they're running their online campaigns
1078.799 -> with a major media agency
1080.96 -> a major media exchange or platform
1084.32 -> or major network
1086.96 -> in fact brand safety should never be
1088.799 -> assumed many major media agencies place
1092.32 -> their advertiser ads and very bad media
1095.28 -> placement that can harm brand image and
1098.08 -> in some cases can be potentially lead to
1100.799 -> legal action
1107.76 -> flipboard is one of the world's first
1110.559 -> social magazines
1112.559 -> inspired by the beauty and ease of print
1115.52 -> media
1116.559 -> the company's mission is to
1118 -> fundamentally improve how people
1120.24 -> discover view and share content across
1123.36 -> their social networks
1126.16 -> flipboard is a content recommendation
1128.72 -> platform that enables publisher
1131.44 -> creators and curators to share stories
1134.32 -> with readers to help them stay up to
1136.32 -> date on their passions and interests
1139.679 -> on average flipboard processes
1141.52 -> approximately 90 million images per day
1144.799 -> to maintain a safe and inclusive
1146.72 -> environment and to confirm that all
1148.96 -> images comply with platform guidelines
1151.28 -> at scale
1152.72 -> it is crucial to implement a content
1155.12 -> moderation workflow using machine
1157.2 -> learning
1159.2 -> however building models for this
1162.08 -> for this system internally was labor
1164.88 -> intensive and lack the accuracy
1166.72 -> necessary to meet the high quality
1168.72 -> standards
1169.84 -> flipboard's users expect
1172.4 -> this is where amazon recognition became
1175.039 -> the right solution
1176.72 -> for their product
1183.84 -> for the highly regulated industries
1186.48 -> financial services there's a need to
1188.799 -> detect
1189.919 -> and redact
1191.28 -> pii to ensure that sensitive user data
1194.96 -> remains private
1196.64 -> your customers can trust your platform
1199.039 -> and increase participation
1201.36 -> investment and referrals
1204.72 -> healthcare there is a need to detect and
1207.039 -> redact pi phi
1209.679 -> and other sensitive information to
1211.44 -> ensure that data remains private
1214.159 -> healthcare providers can remain
1216.24 -> compliant with hipaa and other
1218.799 -> regulatories to avoid fines
1222.24 -> legal brief management
1224.08 -> automate extraction of insight from
1226.24 -> packets on legal briefs such as
1228.64 -> contracts and court records
1231.919 -> further secure your documents by
1233.679 -> identifying and redacting personally
1236.08 -> identify a personally identifiable
1239.679 -> information pii
1242.4 -> process financial documents classify and
1245.2 -> extract entities
1246.799 -> from financial services
1248.799 -> documents such as insurance claims or
1251.28 -> mortgage packages or fine relationships
1254.4 -> between financial events and a financial
1257.84 -> article
1259.36 -> and an example would be to analyze
1261.52 -> support tickets and knowledge articles
1264 -> to detect
1265.44 -> pii entities and redact the text from
1268.88 -> your index
1270.72 -> the documents and search solutions after
1273.36 -> that search solutions are free of pii
1275.919 -> entities and documents
1277.76 -> redacting pii entities help you protect
1280.88 -> privacy and comply with the local laws
1283.6 -> and regulations
1293.919 -> pterodact solutions
1295.919 -> software offers a robust alternative to
1298.64 -> secure information
1300.4 -> sharing in a world of ever-increasing
1302.84 -> compliance and privacy concerns
1306 -> with its signature information
1309.679 -> and presentations
1311.44 -> and capabilities
1313.039 -> pteradac's tools provide the users with
1315.2 -> a safe
1316.559 -> information and sharing
1319.039 -> environment
1326.72 -> addressing your content moderation needs
1328.799 -> requires a combination of computer
1330.88 -> vision text and language
1333.44 -> transform and other aiml capabilities to
1336.559 -> efficiently moderate the increasing
1339.12 -> influx of user-generated content and
1341.6 -> sensitive information
1343.76 -> aws is ai services amazon recognition
1347.52 -> amazon transcribe
1349.36 -> amazon comprehends and amazon translate
1353.039 -> can be leveraged to streamline and
1354.799 -> automate your image and video moderation
1356.72 -> workflows
1358.96 -> i'd like to hand it over to nate to talk
1360.88 -> more about these services and solutions
1364.88 -> let's examine how you can implement low
1366.96 -> code workflows using aws ai services and
1370.48 -> serverless technologies
1375.12 -> you'll need scalable content moderation
1377.039 -> workflows that rely on artificial
1378.88 -> intelligence machine learning deep
1381.12 -> learning and natural language processing