Can ChatGPT work with your enterprise data?

Microsoft Mechanics

Подписаться 347 тыс.

Просмотров 212 тыс.

50% 1

Видео Поделиться Скачать Добавить в

Опубликовано:

22 окт 2024

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист

Посмотреть позже

Комментарии

@MatijaGrcic Год назад

Chapters 00:00:00 - Introduction 00:00:18 - Overview of the Azure OpenAI service 00:01:23 - Applying ChatGPT to enterprise-grade applications on the Azure service 00:02:29 - Retrieval Augmented Generation 00:03:06 - Private Knowledge 0:03:32 - Using ChatGPT in an App 0:04:25 - Asking Questions in the App 0:05:49 - Exposing Details of Conversation Turns 0:06:31 - Injecting fragments of documents 0:06:46 - Different approaches for generating responses 0:08:14 - Adapting style of response 0:09:41 - How Information Protection Works 0:10:02 - Demonstration of Document-Level Granular Access Control 0:11:00 - Adding New Information into Search 0:11:20 - Running Scripts to Add New Information 0:12:04 - Code Behind Sample App 0:12:50 - Overview of ChatGPT 0:13:31 - Using Azure OpenAI Studio Playground 0:14:30 - Building Your Own Enterprise-grade ChatGPT-enabled App

@o365powerplatform9 Год назад

This is pure gold. Would it not be reasonable to expect that in the next few years every major MSFT cloud storage and development tool like Azure SQL, SharePoint, Dataverse, Power Apps and Power Bi to offer this feature automatically?

@johns4651 Год назад

Only a matter of time.

@RichardsonNascimento Год назад

I have been playing around with this solution and it's amazing! Nice work from Pablo and the rest of the team! One thing that I still don't get is when we should use a Cognitive Search to index the content for later retrieval based on text versus using Embeddings to get the semantics of each document, store them in a Vector Store for later search based on embbedings of text search (with cosine similarity) for example.

@chrisalmighty Год назад

I'm interested in this difference as well

@MSFTMechanics Год назад

Thanks for watching and checking out the demo. It is correct, that vector stores can often be useful for this and is an active area of research for us, however from what we have seen although embeddings (for vector search) are generally quite good at helping to recall candidate content they are not necessarily as good at relevancy. The research we have seen seems to show that a more hybrid approach (vector with traditional linguistic search such as BM25) generally provides the best results. In this demo, you might have noticed that we leverage our semantic search capability which first uses linguistic search (BM25) to find good candidate content (L1). Then as a second stage (L2), this content is automatically passed to a ML model (which BTW is the same family of models that powers Bing.com) to help with the re-ranking of these results. Hopefully you will find as you test this demo that this does perform quite well. The other advantage here is that you do not have to do your own vectorization of content which can be both time consuming and expensive. However, as mentioned earlier, we are continuing to research how vectorization can play a part here.

@RichardsonNascimento Год назад

@@MSFTMechanics thank you so much! This is really helpful!

@Quintinliebenberg Год назад

@@RichardsonNascimento I am a complete novice when it comes to these development issues. I am looking for someone to create such a chatbot for my website which uses my own organisation's data / information. Can you maybe help me with this?

@MSFTMechanics Год назад

@@RichardsonNascimento glad it helped. Thanks for taking the time to comment. 🙂

@AnnCatsanndra Год назад

Honestly, the more I hear about this tech, the more I want to use it for worldbuilding and lore for games and storytelling because wooooow that seems like a good way to prevent ever making another characterization flub or timeline mistake ever again.

@JonTaylor-pp4hl Год назад

Great application! However in my experience you would not be able to rely on the current generation of models to avoid flubs or continuity errors. Have a play - see what you find - but I have found that while the response always makes grammatical sense it doesn't always make logical sense. All it does it estimate a plausible set of words to complete the meaning of your prompt. Most of the time this makes logical sense, but there's nothing forcing it to. So while it would generate a plausible, immersive world which mostly worked, I'm sure every now and again you would still get characters coming back from the dead or teleporting from one place to another or whatever...

@RajaSekharaReddyKaluri Год назад

@@JonTaylor-pp4hl what are the downsides in this process? Which part of the process is of concern to you? May I know.

@hdebbache2000 Год назад

You can go even further. How about AI characters that you can interact with and can interact with each other, and who build the lore themselves

@deltabytes Год назад

I like how with every release there more enticing improvements. Game changer.

@MSFTMechanics Год назад

Thank you! Exciting times

@aliasnames5163 Год назад

Finally, a video about this. Can not wait to dive in deeper in to this subject.

@MSFTMechanics Год назад

Glad you liked it. The aha moment for us was that search helps create a giant prompt.

@sudarshanseshadri2144 Год назад

@@MSFTMechanics I was wondering how did you use that giant prompt by using all the histories to the Completion API. I thought there is a limit on the number of token the Completion API can digest?

@paullb2440 Год назад

How do you ensure the cognitive search results don’t exceed the 4096 token limit for the ChatGPT? And if they do exceed this (entirely possible with large amount of corporate data), how son you chunk it for ChatGPT ?

@CueAI 11 месяцев назад

That problem seems to be partially solved now with the token limit increase.

@davidmichaelcomfort Год назад

It looks really interesting. What about using it to gather insights about structured data? Say for a set of headlines, what is the top-performing headline (based upon summary data) and what the CTA is and how far above or below a benchmark? Basically gathering insights, thru a guided process, from structured data?

@pradeepuitda Год назад

Hi David, I am looking for insights from structured data as well. Let me know if you already figured it out. Thanks!

@daveebbelaar Год назад

Thanks for this great video, really exciting! I have one question: Are prompts (and thus company information) processed exclusively in Azure OpenAI Service, and NOT through OpenAI's API?

@MSFTMechanics Год назад

Yes, it's a separate instance of the LLM.

@daveebbelaar Год назад

@@MSFTMechanics Great, thank you!

@JuanBretti Год назад

Very impressive. Looking forward to see how it could work with technical documents.

@MSFTMechanics Год назад

Glad you see the potential. Thanks for taking the time to comment

@RajaSekharaReddyKaluri Год назад

We are trying to build a similar solution to enable conversational q&a but using elastic search for indexing with embeddings. 1. How do you decide on the chunk size before indexing? 2. How different would the retrieved chunks based on cosine similarity be when compared with cognitive search?

@pedrogorilla483 Год назад

Ask chatgpt

@krishnabala8403 10 месяцев назад

Thank you! Really well explained. I'm eager to get started with some prototyping. Will do that soon!

@jimg8296 Год назад

OMG been looking for this information for 3 weeks. Thank you. Saw it before on another channel but it was very confusing compared to how this video explained things.

@MSFTMechanics Год назад

Glad this helped

@RajaSekharaReddyKaluri Год назад

So you implement this solution?

@jimg8296 Год назад

@@RajaSekharaReddyKaluri Not yet I am going to propose a POC for my company to do this to assist new on-boarded developers code to our standards.

@joseanibalortega8896 Год назад

Could you share the script to update the data? explain how it works? Do you have an automatic way to update the application by doing azd deploy? or other?

@bytesizedbraincog Год назад

Amazing one! I love it, there are so much nuances in this - its not just simple retrieve and generate - Sudden curiosity - if LLM has much reasoning and language understanding - why cant we ask it straightaway on ranking the documents and filtering the unnecessary ones as a in-context learning prompt - why do we need separate re-ranker component?

@temiwale88 Год назад

How do we connect this to Sharepoint?

@vikagridina7464 Год назад

Sorry, I didn't get it. Do I understand correctly that if we want to keep our data private, we need to keep it separate from the model? And only add a piece of information during the response generation process. If we start to teach the model our data, will it become public?

@MSFTMechanics Год назад

Great questions. Short answer is no. The search is retrieving additional information to add to the prompt. Information from prompts is not stored in the large language model. Also, there are multiple instances of the model running, and the ones used for the Azure OpenAI Service are not public instances.

@GPTSoManyQuestions Год назад

Please help!!! What advice do you have for where to store the data used to tune the GPT? We have a complex data set in Azure Storage tables, and are wondering the best database...is it AzureSQL, Access? AzureBlob? Something else?

@andyschluter6840 Год назад

Thank you for this video, it shows that there is an understanding of the company's fears of data loss. I have to test now :-)

@MSFTMechanics Год назад

Thanks for taking the time to comment and glad you liked it.

@denwo1982 Год назад

What do you mean by data loss?

@matthewtschetter1953 Год назад

Question. If we have all our data on remote servers or through AWS, would it be difficult to use those data sources?

@davidblakeley1983 Год назад

I want to integrate this with Microsoft teams across my enterprise. Is this able to be done and if so what is recommended? PVAT?

@solitone Год назад

When Azure OpenAI Service will be available to the general public to experiment with? Currently you need to submit a request form and be approved.

@pradeepthiyyagura8677 Год назад

Great presentation! Now that this information is public knowledge, I need to come up with something more creative when communicating with clients who are interested in LLM's : )

@MSFTMechanics Год назад

That's why we do what we do on Mechanics. Thank you!

@sundarramanp3057 Год назад

Can you share the link to learn more through videos like these on the features offered by Azure AI studio?

@Casper-AI Год назад

Very interesting! Would it also be possible to make an integration between GPT and SAP or MS Dynamics? I am an SAP FI consultant handling incidents and changes submitted by the finance departments. Would it be possible to make a private model in which GPT can read through the SAP system and give instructions on how to solve certain incidents? For example, if a user gets a certain error when performing a payment run, would GPT be able to analyse where in the system this error is coming from and how to solve it? Not just giving recommendations as is it does now when anonymizing the data and submitting it in the public GPT environment. And fff course all without sharing any information to the outside world.

@solitone Год назад

are source citations accurate? because in bing they are often wrong: when you click on a citation, you realize the referenced website is not the actual source of the information provided

@GenocideBeast Год назад

I'm assuming this works with a code repository hosted on Azure as well right?

@rjanstenseng144 10 месяцев назад

What is the power consuption of this services ?

@denwo1982 Год назад

Can I get it to point to a sql table?

@simmer484 Год назад

They have a video for that using Azure SQL

@matthiasblank7218 Год назад

Are there limitations in Azure Cognitive Search regarding language? Is German worse than English? Thanks!

@satishkumar-ir9wy Год назад

Impressive, the one i was looking for a long while. can anyone suggest, which language model of Azure Open AI can i use to compare 02 pdf documents to check whether the information is available in both documents or not?

@Rhcpbedders Год назад

Can this use data from SharePoint or does it need to be stored in Azure storage?

@TheUsamawahabkhan Год назад

Can we integrate azure bot framework? If I want add action to promote results? And is there any way we can do content moderation?

@marcelagalvis1731 Год назад

Hi, do you have details on the type of RBAC role required to be able to deploy the demo? I am getting a : 'the client does not have the necessary permissions to perform the specified action', I have Cognitive Services Contributor access

@gowrisankarpokuri Год назад

How that Chat GPT take care of data security? Like, how Chat GPT or Cognitive search restricts documents/content where a user does not have access?

@NitinPasumarthy Год назад

Thanks for sharing these invaluable tips and source code. Any experiment results on which approach (e.g. read decompose ask) to use where?

@Suriprofz 5 месяцев назад

Not sure why its all based on pdfs... regular companies have data in sql. why not show some examples there? how to query relational data

@MSFTMechanics 5 месяцев назад

You can also query relational data. We only demonstrated documents, because that was the primary form of that data in the open source sample app

@felipeblin8616 Год назад

I looks great!! thanks. is there a video explaining step by step the coding from star to end. It would be great for those us who are starting in the AI and Azure

@temiwale88 Год назад

Is our data kept privately or shared with OpenAI or used for research?

@MSFTMechanics Год назад

We cover that in the video. Your data is not used for training the large language model, it's only part of the prompt for inference as demonstrated in the example.

@temiwale88 Год назад

@@MSFTMechanics Noted. Thanks for responding! So if that's the case, can an organization be HIPAA-compliant (w.r.t to not exposing PII and PHI)? I want to make sure that the in-context learning (or 'RAG') paradigm doesn't expose our customers data to OpenAI / Azure OpenAI or anyone else. That's probably the biggest blocker to implementing any production-grade app for our team. Thanks for a thorough answer in advance.

@jhayes0128 Год назад

What is the chat UI? Something custom they made or is it available for enterprise customers?

@MSFTMechanics Год назад

This is the sample app available on GitHub at aka.ms/EntGPTSearch

@bullyellis1 Год назад

Is there a c# port of this project ?

@pedroagma4417 Год назад

The question I would have is this one: do the « private data » is « protected »? In a chat, chatgpt said that I sould not share private information with it, because it cannot guarantee that the data « will not be used / made public or something ».

@MSFTMechanics Год назад

This is a separate instance running in the Azure OpenAI Service and designed to maintain privacy.

@1242elena Год назад

Fantastic I'm using it for a legal database and research bot!

@ko-Daegu Год назад

how do we make sure that Microsoft is not taking our own property code/data in the cognitive search ??

@chrisalmighty Год назад

Where someone has a long session, how does AzureOpenAI service deal with token limits where it has to give the whole context especially where previous responses are long?

@RajaSekharaReddyKaluri Год назад

It's a different api call in every turn.

@SravanKumar-jy1ik Год назад

How to use the same ai response on your MS Teams?

@tuapuikia Год назад

Can we train our confluence wiki to azure gpt?

@neerajm1007 Год назад

i have the same question here

@RajaSekharaReddyKaluri Год назад

Can anyone help me with robust strategy for handling dependent and independent questions during the conversation, including generating a standalone question to provide additional context for dependent questions? Is the strategy used here to augment the user's latest question with prior conversation history robust for all kinds of scenarios?

@vbprasad Год назад

How's the solution implemented to have information protected at user level?

@MSFTMechanics Год назад

You would instrument the same access controls and permissions as you would now for implementing Azure Cognitive Search. We demonstrate that in the video. The information used to augment the prompt is retrieved based on the individual's permissions.

@KBProfile Год назад

So this is langchain + openai API? Nice

@Zorkyx22 Год назад

Hi! I've been trying to recreate this project on my machine and I'm getting an error I don't quite get. I've found a workaround, but I feel like my workaround is reducing the performance of the assistant. I'm using an AzureOpenAI service that is based on gpt-35-turbo and when I try to ask a question using RRR or RDA I'm getting an exception saying that gpt-35-turbo does not support parameters "logprobs, best_of and echo". I've deactivated them in order to make the project work, but as I've said it feels like the quality of the responses have diminished. Did anybody else encounter this problem?

@danillopinheironeto Год назад

Hi, I have not, actually, I'm using only the chat version, I've removed the ask feature.

@mgkrishy Год назад

Stunning 🤩

@九叔叔 Год назад

it‘’s very good for open and cognitive search

@logarithm0 Год назад

How do you update data after adding or removing docs?

@MSFTMechanics Год назад

We show the manual, on-demand process for updating the search index at 11:27, but normally these types updates would run on a schedule or based on eventing logic.

@QuickdrawWilly Год назад

I have been waiting for this, This will be a game changer!

@MSFTMechanics Год назад

It will be. Retrieval Augmented Generation with search is a big deal for generating informed responses.

@constatineb7065 Год назад

what a shame individuals cannot use the Azura openAI services

@MSFTMechanics Год назад

You can sign up for it as an individual developer, but you do need an Azure subscription. For a "free to use" option, you can also try Bing Chat if you're looking for alternatives to OpenAI chat.