I-Amazon Bedrock inikeza uhla olubanzi lwamamodeli esisekelo asebenza kahle kakhulu avela e-Amazon nezinye izinkampani ezihamba phambili ze-AI, okuhlanganisa I-Anthropic, I-AI21, Meta, Cohere, Futhi Ukuzinza kwe-AI, futhi ihlanganisa izimo ezibanzi zokusebenzisa, okuhlanganisa ukukhiqizwa kombhalo nezithombe, ukusesha, ukuxoxa, ukucabanga kanye nama-agent, nokunye. Okusha I-Amazon Titan Image Generator imodeli ivumela abadali bokuqukethwe ukuthi bakhiqize ngokushesha izithombe zekhwalithi ephezulu, ezingokoqobo besebenzisa ukwaziswa kombhalo wesiNgisi olula. Imodeli ye-AI ethuthukisiwe iqonda imiyalo eyinkimbinkimbi enezinto eziningi futhi ibuyisela izithombe zekhwalithi yesitudiyo ezifanele ukukhangisa, i-ecommerce, kanye ukuzijabulisa. Izici ezibalulekile zifaka ikhono lokwenza ngcono izithombe ngokuphindaphinda ekwazisweni, ukuhlela okungemuva okuzenzakalelayo, nokukhiqiza ukuhlukahluka okuningi kwesigcawu esifanayo. Abadali bangakwazi futhi ukwenza imodeli ngendlela oyifisayo ngedatha yabo ukuze bakhiphe izithombe zomkhiqizo ngesitayela esithile. Okubalulekile, i-Titan Image Generator inezivikelo ezakhelwe ngaphakathi, njengama-watermark angabonakali kuzo zonke izithombe ezikhiqizwe yi-AI, ukukhuthaza ukusetshenziswa ngendlela efanele futhi unciphise ukusatshalaliswa kolwazi oluyi-disinformation. Lobu buchwepheshe obusha benza ukukhiqiza izithombe zangokwezifiso ngevolumu enkulu noma iyiphi imboni kufinyeleleke kakhudlwana nangempumelelo.
The new I-Amazon Titan Multimodal Embeddings imodeli isiza ukwakha usesho olunembe kakhudlwana nezincomo ngokuqonda umbhalo, izithombe, noma kokubili. Iguqula izithombe nombhalo wesiNgisi kube ama-semantic vectors, ithwebula incazelo nobudlelwano kudatha yakho. Ungakwazi ukuhlanganisa umbhalo nezithombe njengezincazelo zomkhiqizo nezithombe ukuze uhlonze izinto ngempumelelo kakhudlwana. Ama-vector anika amandla umuzwa wokusesha osheshayo, onembile. Ukushumeka kwe-Titan Multimodal kuyavumelana nezimo kubukhulu be-vector, okunika amandla ukulungiselelwa kwezidingo zokusebenza. I-asynchronous API kanye Isevisi ye-Amazon OpenSearch Isixhumi sikwenza kube lula ukuhlanganisa imodeli kuzinhlelo zakho zokusebenza zosesho lwe-neural.
Kulokhu okuthunyelwe, sihamba ngendlela yokusebenzisa i-Titan Image Generator kanye namamodeli we-Titan Multimodal Embeddings nge-AWS Python SDK.
Ukukhiqiza nokuhlela isithombe
Kulesi sigaba, sibonisa amaphethini ekhodi ayisisekelo okusebenzisa i-AWS SDK ukuze ukhiqize izithombe ezintsha futhi wenze ukuhlela okunamandla e-AI ezithombeni ezikhona. Izibonelo zekhodi zinikezwe kuPython, futhi iJavaScript (Node.js) nayo iyatholakala kulokhu IGitHub repository.
Ngaphambi kokuthi ubhale izikripthi ezisebenzisa i-Amazon Bedrock API, udinga ukufaka inguqulo efanele ye-AWS SDK endaweni yangakini. Ngemibhalo yePython, ungasebenzisa i I-AWS SDK yePython (Boto3). Abasebenzisi bePython bangase futhi bafune ukufaka ifayela le- Imojula yomcamelo, esiza ukusebenza kwesithombe njengokulayisha nokugcina izithombe. Ukuze uthole imiyalelo yokusetha, bheka ku- IGitHub repository.
Ukwengeza, vumela ukufinyelela ku-Amazon Titan Image Generator kanye namamodeli we-Titan Multimodal Embeddings. Ukuze uthole ukwaziswa okwengeziwe, bheka Ukufinyelela imodeli.
Imisebenzi yomsizi
Umsebenzi olandelayo usetha iklayenti lesikhathi sokusebenza se-Amazon Bedrock Boto3 futhi ukhiqize izithombe ngokuthatha imithwalo yemisebenzi ehlukahlukene (esixoxa ngayo kamuva kulokhu okuthunyelwe):
Khiqiza izithombe ngombhalo
Imibhalo ekhiqiza isithombe esisha ekwazisweni kombhalo ilandela le phethini yokuqalisa:
- Lungiselela ukwaziswa kombhalo kanye nokwaziswa kombhalo ongakhetha kukho okunegethivu.
- Sebenzisa le
BedrockRuntime
iklayenti ukunxenxa imodeli ye-Titan Image Generator. - Hlaziya futhi unqume impendulo.
- Londoloza izithombe eziwumphumela kudiski.
Umbhalo ukuya esithombeni
Okulandelayo iskripthi esijwayelekile sokukhiqiza isithombe semodeli ye-Titan Image Generator:
Lokhu kuzokhiqiza izithombe ezifana nalezi ezilandelayo.
Isithombe sempendulo 1 | Isithombe sempendulo 2 |
Okuhlukile kwesithombe
Ukuhluka kwesithombe kunikeza indlela yokwenza okuhlukile okucashile kwesithombe esikhona. Amazwibela ekhodi alandelayo asebenzisa esinye sezithombe ezikhiqizwe esibonelweni sangaphambilini ukudala izithombe ezihlukile:
Lokhu kuzokhiqiza izithombe ezifana nalezi ezilandelayo.
Isithombe sangempela | Isithombe sempendulo 1 | Isithombe sempendulo 2 |
Hlela isithombe esikhona
Imodeli ye-Titan Image Generator ikuvumela ukuthi wengeze, ususe, noma ushintshe ama-elementi noma izindawo ngaphakathi kwesithombe esikhona. Ucacisa ukuthi iyiphi indawo okufanele uyithinte ngokunikeza okukodwa kokulandelayo:
- Isithombe semaski – Isithombe semaski siyisithombe esinambambili lapho amaphikseli enani elingu-0 amele indawo ofuna ukuyithinta kanye namaphikseli enani angu-255 amele indawo okufanele ihlale ingashintshiwe.
- Imaski ngokushesha - Ukwaziswa kwemaski kuyincazelo yombhalo wemvelo yolimi lwezakhi ofuna ukuzithinta, esebenzisa imodeli yangaphakathi yokuhlukanisa umbhalo ube izingxenye.
Ukuze uthole ukwaziswa okwengeziwe, bheka Imihlahlandlela yobunjiniyela esheshayo.
Imibhalo esebenza ngokuhlelwa esithombeni ilandela le phethini yokusetshenziswa:
- Layisha isithombe esizohlelwa kudiski.
- Guqula isithombe sibe iyunithi yezinhlamvu enekhodi engu-base64.
- Lungiselela imaski ngokusebenzisa enye yezindlela ezilandelayo:
- Layisha isithombe semaski kusuka kudiski, usifake njenge-base64 bese usibeka njengefayela le-
maskImage
ipharamitha. - Setha
maskText
ipharamitha encazelweni yombhalo yezinto ezizothinta.
- Layisha isithombe semaski kusuka kudiski, usifake njenge-base64 bese usibeka njengefayela le-
- Cacisa okuqukethwe okusha okuzokwenziwa kusetshenziswa enye yezinketho ezilandelayo:
- Ukwengeza noma ukushintsha i-elementi, setha i-
text
ipharamitha encazelweni yokuqukethwe okusha. - Ukuze ususe i-elementi, shiya i-
text
ipharamitha ngokuphelele.
- Ukwengeza noma ukushintsha i-elementi, setha i-
- Sebenzisa le
BedrockRuntime
iklayenti ukunxenxa imodeli ye-Titan Image Generator. - Hlaziya futhi unqume impendulo.
- Londoloza izithombe eziwumphumela kudiski.
Ukuhlela into: Ukupenda ngesithombe semaski
Okulandelayo iskripthi esijwayelekile sokuhlela isithombe semodeli ye-Titan Image Generator esetshenziswa maskImage
. Sithatha esinye sezithombe ezikhiqizwe ngaphambilini futhi sinikeze isithombe semaski, lapho amaphikseli enani elingu-0 ahunyushwa ngokuthi amaphikseli amnyama namaphikiseli angu-255 njengokumhlophe. Siphinde simiselele enye yezinja esithombeni ikati sisebenzisa ukwaziswa kombhalo.
Lokhu kuzokhiqiza izithombe ezifana nalezi ezilandelayo.
Isithombe sangempela | Isithombe Semaski | Isithombe Esihleliwe |
Ukususwa kwento: Ukupenda ngomyalo wemaski
Kwesinye isibonelo, sisebenzisa maskPrompt
ukucacisa into esesithombeni, ethathwe ezinyathelweni zangaphambili, ukuze ihlelwe. Ngokukhipha ukwaziswa kombhalo, into izosuswa:
Lokhu kuzokhiqiza izithombe ezifana nalezi ezilandelayo.
Isithombe sangempela | Isithombe sokuphendula |
Ukuhlela ingemuva: Ukupenda ngaphandle
Ukupenda ngaphandle kuyasiza uma ufuna ukushintsha ingemuva lesithombe. Ungakwazi futhi ukunweba imingcele yesithombe ukuze uthole umphumela wokuhlehlisa. Kuskripthi sesibonelo esilandelayo, sisebenzisa maskPrompt
ukucacisa ukuthi iyiphi into okufanele igcinwe; ungasebenzisa futhi maskImage
. Ipharamitha outPaintingMode
icacisa ukuthi kuvunyelwe yini ukuguqulwa kwamaphikseli ngaphakathi kwemaski. Uma isethwe njenge DEFAULT
, amaphikseli angaphakathi kwemaski avunyelwe ukuthi ashintshwe ukuze isithombe esakhiwe kabusha silingane sisonke. Le nketho iyanconywa uma i maskImage
enikeziwe ayimeli into enezinga lephikseli ngokunemba. Uma isethwe njenge PRECISE
, ukuguqulwa kwamaphikseli ngaphakathi kwemaski kuyavinjelwa. Le nketho iyanconywa uma usebenzisa i-a maskPrompt
noma maskImage
elimele into enezinga le-pixel ngokunemba.
Lokhu kuzokhiqiza izithombe ezifana nalezi ezilandelayo.
Isithombe sangempela | Umbhalo | Isithombe sokuphendula |
“ibhishi” | ||
"ihlathi" |
Ngaphezu kwalokho, imiphumela yamanani ahlukene we outPaintingMode
, Nge maskImage
ezingabonisi into ngokunemba kwezinga le-pixel, zimi kanje.
Lesi sigaba sikunikeze isifinyezo semisebenzi ongayenza ngemodeli ye-Titan Image Generator. Ngokukhethekile, lezi zikripthi zibonisa umbhalo uye esithombeni, ukuhluka kwesithombe, ukupenda, kanye nemisebenzi yokupenda ngaphandle. Kufanele ukwazi ukulungisa amaphethini ezinhlelo zakho zokusebenza ngokubhekisela imininingwane yepharamitha yalezo zinhlobo zomsebenzi ezinemininingwane ku. Imibhalo ye-Amazon Titan Image Generator.
Ukushumeka kwe-Multimodal nokusesha
Ungasebenzisa imodeli ye-Amazon Titan Multimodal Embeddings ngemisebenzi yebhizinisi efana nosesho lwezithombe nesincomo esisekelwe kokufana, futhi inokunciphisa okwakhelwe ngaphakathi okusiza ukunciphisa ukuchema emiphumeleni yokusesha. Kukhona osayizi abaningi bokushumeka bobukhulu bokuhwebelana okungcono kakhulu kokubambezeleka/ukunemba kwezidingo ezihlukene, futhi konke kungenziwa ngendlela oyifisayo nge-API elula ukuze ivumelane nedatha yakho kuyilapho uphikelela ekuvikelekeni kwedatha nobumfihlo. I-Amazon Titan Multimodal Embeddings inikezwa njengama-API alula wesikhathi sangempela noma i-asynchronous batch yokuguqula ukusesha kanye nezinhlelo zokusebenza zokuncoma, futhi ingaxhunywa kumininingwane egciniwe ye-vector ehlukene, okuhlanganisa. Isevisi ye-Amazon OpenSearch.
Imisebenzi yomsizi
Umsebenzi olandelayo uguqula isithombe, bese ubhala ngokuzikhethela, ube ukushumeka kwezindlela eziningi:
Umsebenzi olandelayo ubuyisela okushumekiwe okuphezulu okufanayo kwe-multimodal uma kunikezwe umbuzo wokushumekwa kwe-multimodal. Qaphela ukuthi ngokusebenza, ungasebenzisa isizindalwazi se-vector esiphethwe, njenge-OpenSearch Service. Isibonelo esilandelayo ngesezinjongo zemifanekiso:
Isethi yedatha yokwenziwa
Ngezinjongo zemifanekiso, sisebenzisa Imodeli ka-Anthropic ka-Claude 2.1 ku-Amazon Bedrock ukukhiqiza ngokungahleliwe imikhiqizo eyisikhombisa eyahlukene, ngayinye inezinhlobonhlobo ezintathu, usebenzisa umyalo olandelayo:
Generate a list of 7 items description for an online e-commerce shop, each comes with 3 variants of color or type. All with separate full sentence description.
Okulandelayo uhlu lwemiphumela ebuyisiwe:
Yabela impendulo engenhla kokuguquguqukayo response_cat
. Bese sisebenzisa imodeli ye-Titan Image Generator ukuze sakhe izithombe zomkhiqizo wento ngayinye:
Zonke izithombe ezikhiqiziwe zingatholakala ku-appendix ekupheleni kwalokhu okuthunyelwe.
Inkomba yesethi yedatha ye-Multimodal
Sebenzisa ikhodi elandelayo ukuze uthole inkomba yedathasethi ye-multimodal:
Ukusesha kwe-Multimodal
Sebenzisa ikhodi elandelayo ukuze useshe izindlela eziningi:
Okulandelayo eminye imiphumela yosesho.
Isiphetho
Okuthunyelwe kwethula i-Amazon Titan Image Generator kanye namamodeli we-Amazon Titan Multimodal Embeddings. I-Titan Image Generator ikuvumela ukuthi udale ngokwezifiso, izithombe zekhwalithi ephezulu kusuka emiyalweni yombhalo. Izici ezibalulekile zifaka ukuphindaphinda ekwazisweni, ukuhlela okungemuva okuzenzakalelayo, nokwenza ngokwezifiso idatha. Inezivikelo ezifana nama-watermark angabonakali ukukhuthaza ukusetshenziswa okufanele. Ukushumeka kwe-Titan Multimodal kuguqula umbhalo, izithombe, noma kokubili kube ama-semantic vectors ukuze kunikwe amandla ukusesha okunembile nezincomo. Sibe sesihlinzeka ngamasampula ekhodi ye-Python ukuze sisebenzise lezi zinsizakalo, futhi sabonisa ukukhiqiza izithombe kusuka emiyalweni yombhalo kanye nokuphindaphinda kulezo zithombe; ukuhlela izithombe ezikhona ngokungeza, ukususa, noma ukufaka esikhundleni sezakhi ezicaciswe yizithombe zemaski noma umbhalo wemaski; ukudala ukushumeka kwe-multimodal kusuka kumbhalo, izithombe, noma kokubili; kanye nokusesha okufanayo okushumekiwe kwe-multimodal embuzweni. Siphinde sabonisa sisebenzisa idathasethi yokwenziwa ye-e-commerce ekhonjwe futhi yaseshwa kusetshenziswa Ukushumeka kwe-Titan Multimodal. Inhloso yalokhu okuthunyelwe ukunika amandla onjiniyela ukuthi baqale ukusebenzisa lezi zinsizakalo ezintsha ze-AI ezinhlelweni zabo zokusebenza. Amaphethini ekhodi angasebenza njengezifanekiso zokusetshenziswa ngokwezifiso.
Wonke amakhodi ayatholakala ku- IGitHub repository. Ukuze uthole olunye ulwazi, bheka ku- I-Amazon Bedrock User Guide.
Mayelana Ababhali
Rohit Mittal unguMphathi Womkhiqizo Oyinhloko e-Amazon AI eyakha amamodeli esisekelo esinezimo eziningi. Usanda kuhola ukwethulwa kwemodeli ye-Amazon Titan Image Generator njengengxenye yenkonzo ye-Amazon Bedrock. Unolwazi ku-AI/ML, NLP, kanye Nosesho, unentshisekelo yokwakha imikhiqizo exazulula amaphuzu obuhlungu bekhasimende ngobuchwepheshe obusha.
UDkt. Ashwin Swaminathan ungumcwaningi weComputer Vision kanye Nomshini Wokufunda, unjiniyela, kanye nomphathi oneminyaka engu-12+ yesipiliyoni sembonini kanye neminyaka engu-5+ yesipiliyoni socwaningo lwezemfundo. Izisekelo eziqinile kanye nekhono elifakazelwe lokuthola ulwazi ngokushesha nokuba negalelo ezindaweni ezintsha nezisafufusa.
UDkt. Yusheng Xie Ungu-Principal Applied Scientist e-Amazon AGI. Umsebenzi wakhe ugxile ekwakheni amamodeli esisekelo se-multi-modal. Ngaphambi kokujoyina i-AGI, ubehola intuthuko ehlukahlukene ye-AI enezimo eziningi kwa-AWS njenge-Amazon Titan Image Generator kanye ne-Amazon Textract Queries.
UDkt. Hao Yang uyiPrincipal Applied Scientist e-Amazon. Izithakazelo zakhe eziyinhloko zocwaningo ukuthola izinto nokufunda ngezichasiselo ezinomkhawulo. Ngaphandle komsebenzi, u-Hao uthanda ukubukela amafilimu, ukuthwebula izithombe, nemisebenzi yangaphandle.
UDkt Davide Modolo uyi-Applied Science Manager kwa-Amazon AGI, esebenza ekwakheni amamodeli amakhulu esisekelo se-multimodal. Ngaphambi kokujoyina i-Amazon AGI, ubengumphathi/ehola iminyaka eyi-7 kuma-AWS AI Labs (i-Amazon Bedrock ne-Amazon Rekognition). Ngaphandle komsebenzi, uthanda ukuhamba nokudlala noma yiluphi uhlobo lomdlalo, ikakhulukazi ibhola likanobhutshuzwayo.
UDkt. Baichuan Sun, okwamanje ukhonza njengo-Sr. AI/ML Solutions Architect kwa-AWS, egxile ku-AI ekhiqizayo futhi usebenzisa ulwazi lwakhe kusayensi yedatha nokufunda komshini ukuze anikeze izixazululo ezisebenzayo, ezisekelwe emafini. Ngesipiliyoni sokubonisana nabaphathi kanye nesakhiwo sesisombululo se-AI, ubhekana nezinselelo eziningi eziyinkimbinkimbi, okuhlanganisa umbono wekhompyutha wamarobhothi, ukubikezela kochungechunge lwesikhathi, nokugcinwa kokubikezela, phakathi kokunye. Umsebenzi wakhe usekelwe kusizinda esiqinile sokuphathwa kwephrojekthi, i-software ye-R&D, kanye nokuphishekela imfundo. Ngaphandle komsebenzi, uDkt. Sun ujabulela ukulinganisela kokuhamba nokuchitha isikhathi nomndeni nabangane.
UDkt. Kai Zhu okwamanje usebenza njengoNjiniyela Wokusekela Kwamafu kwa-AWS, esiza amakhasimende anezinkinga kumasevisi ahlobene ne-AI/ML njenge-SageMaker, i-Bedrock, njll. Uyisazi se-SageMaker Subject Matter. Unolwazi lwesayensi yedatha nobunjiniyela bedatha, unentshisekelo yokwakha amaphrojekthi anamandla e-AI.
Kris Schultz isichithe iminyaka engaphezu kwengu-25 iletha okuhlangenwe nakho komsebenzisi okubandakanyayo ekuphileni ngokuhlanganisa ubuchwepheshe obusafufusa nomklamo osezingeni lomhlaba. Endimeni yakhe njengoMphathi Omkhulu Womkhiqizo, u-Kris usiza ukuklama nokwakha izinsiza ze-AWS ukuze anikeze amandla iMedia & Entertainment, Amageyimu, kanye ne-Spatial Computing.
isithasiselo
Ezigabeni ezilandelayo, sibonisa izimo zesampula eziyinselele ezifana nokufakwa kombhalo, izandla, nokuboniswa ukuze kugqanyiswe amakhono emodeli ye-Titan Image Generator. Siphinde sifake isampula yezithombe ezikhiqizwe ezibonelweni zangaphambili.
Umbhalo
Imodeli ye-Titan Image Generator iphumelela kakhulu ekusebenzeni okuyinkimbinkimbi njengokufaka umbhalo ofundekayo ezithombeni. Lesi sibonelo sibonisa ikhono le-Titan lokunikeza ngokucacile osonhlamvukazi nabancane ngesitayela esingaguquki ngaphakathi kwesithombe.
i-corgi egqoke ikepisi le-baseball elinombhalo othi "genai" | umfana ojabulile onika isithupha, egqoke isikithi esinombhalo othi “generative AI” |
izandla
Imodeli ye-Titan Image Generator futhi inamandla okukhiqiza izithombe ezinemininingwane ye-AI. Isithombe sibonisa izandla neminwe engokoqobo enemininingwane ebonakalayo, idlulela ngalé kwesizukulwane sesithombe se-AI esiyisisekelo esingase sintule ukucaciswa okunjalo. Kulezi zibonelo ezilandelayo, qaphela ukuvezwa okunembile kokuma kanye ne-anatomy.
isandla somuntu esibukwa phezulu | ukubuka kahle izandla zomuntu ophethe inkomishi yekhofi |
Mirror
Izithombe ezikhiqizwe imodeli ye-Titan Image Generator zihlela ngokwendawo izinto futhi zibonisa imiphumela yesibuko ngokunembile, njengoba kuboniswe ezibonelweni ezilandelayo.
Izithombe zomkhiqizo zokwenziwa
Okulandelayo yizithombe zomkhiqizo ezenziwe ekuqaleni kwalokhu okuthunyelwe kumodeli yokushumeka kwe-Titan Multimodal.
- I-SEO Powered Content & PR Distribution. Khuliswa Namuhla.
- I-PlatoData.Network Vertical Generative Ai. Zinike Amandla. Finyelela Lapha.
- I-PlatoAiStream. I-Web3 Intelligence. Ulwazi Lukhulisiwe. Finyelela Lapha.
- I-PlatoESG. Ikhabhoni, I-CleanTech, Amandla, Environment, Ilanga, Ukuphathwa Kwemfucuza. Finyelela Lapha.
- I-PlatoHealth. I-Biotech kanye ne-Clinical Trials Intelligence. Finyelela Lapha.
- Source: https://aws.amazon.com/blogs/machine-learning/use-amazon-titan-models-for-image-generation-editing-and-searching/