Тёмный

How to Detect & Bypass Detection of ChatGPT and GPT3 Generated Text 

Cristi Vlad
Подписаться 36 тыс.
Просмотров 33 тыс.
50% 1

Опубликовано:

 

23 окт 2024

Поделиться:

Ссылка:

Скачать:

Готовим ссылку...

Добавить в:

Мой плейлист
Посмотреть позже
Комментарии : 69   
@FerdieJamesNervida
@FerdieJamesNervida Год назад
I recently tested and made a system on how to bypass AI Detection tools. It involves paraphrasing and asking AI to deliberately make misspellings. It worked 90% of the time but is time consuming. Now I just told ChatGPT to "write an article about "XXY" so that 100% of the time it does not contain any machine-generated content." That pretty much solved this problem and bypassed any AI Content Detection tool available so far. Of course, this may change as technology grows. It's just a matter of who is diligent enough to go deeper into the rabbit hole. Cheers!
@Prosperidadedigital
@Prosperidadedigital Год назад
can you explain more about this ?
@CristiVladZ
@CristiVladZ Год назад
yes! let's hear it! :)
@FerdieJamesNervida
@FerdieJamesNervida Год назад
@@CristiVladZ Unfortunately, its not a panacea. The writer will need to test the prompts if it gives the preferred results. The best method now is back to the manual way. Run the content to Quillbot paraphraser then deliberately make some typographical error and ask ChatGPT again to rewrite word for word but insert 1 grammatical error. This will make the AI detection tool see it as Human generated content. I tried GPTZero and it is more robust. No amount of rephrasing and typo can trick it. The challenge now is how to bypass that.
@schmetterling4477
@schmetterling4477 Год назад
And everything you have generated that way still sounds like the below average internet troll. ;-)
@justclick1n
@justclick1n Год назад
@@FerdieJamesNervida Tried this and it seem to be work but need to play around with more articles to test the same...
@danp6118
@danp6118 Год назад
In summary, the text describes the use of a language model called Roberta-based OpenAI detector to determine if text has been generated by a chatbot called GPT (Generative Pre-training Transformer). The detector was trained by fine-tuning a Roberta-based model with the outputs of GPT2. The text also mentions Hugging Face, a platform for AI-related resources, and its hosted API interface for the detector model. The text then goes on to test the effectiveness of the detector by feeding it text generated by a version of GPT called Chat GPT and text taken from a Google search on implementing encryption in Android apps. The text concludes by stating that the detector is able to accurately identify fake text generated by GPT, but may not work as well for text taken from other sources.
@CristiVladZ
@CristiVladZ Год назад
The summarizer didn't do the best job here
@jean-baptistedelabroise5391
hmm, I think if I implement en detector I will preprocess the input to remove the characters that are not encountered in the vocabulary of the model. but yes, there are many adversarial examples to trick the detector I think.
@CristiVladZ
@CristiVladZ Год назад
I think you might also be able to use the model itself to write in unpredictable ways. I'm testing with this as we speak
@nuduw
@nuduw Год назад
@@CristiVladZ Have you made a progress in this regard?
@phily8020
@phily8020 Год назад
Fantastic video, please do more experiments like this. I like your methodical approach
@CristiVladZ
@CristiVladZ Год назад
Thanks
@Nocare89
@Nocare89 Год назад
If the language model can analyze the probability of a word coming next it certainly could be made to identify low-probability breaks in the chain. But its a seesaw of fix and brake.
@bartech101
@bartech101 Год назад
Was thinking about the same. Why only 2 odd / characters in 131 have so much weight in deciding is text AI generated or not.
@CristiVladZ
@CristiVladZ Год назад
This gives me idea for prompt engineering to bypass the detection automatically
@Nocare89
@Nocare89 Год назад
@@CristiVladZ Haha I like it
@beyonddev4008
@beyonddev4008 Год назад
I played around with the detector and it’s more focused on grammer, I made 3 grammer errors by removing a period and 2 commas and it said it was real, it is very interesting but I wouldn’t trust it 100%, I think it’s a 50/50 depending on who wrote it, how they wrote it, ect.
@pillowbug999
@pillowbug999 Год назад
Yes , i also experimented and it changes drastically then u use commas in specal grammer places
@CristiVladZ
@CristiVladZ Год назад
that's a hack.
@timtim2500
@timtim2500 Год назад
Has the code been updated since your video? I mean, out of curiosity, I copied some AI text into the detector... and it detected its 99.5% fake... then I tried to add the / but its no longer causing the results to change
@CristiVladZ
@CristiVladZ Год назад
chatGPT has been updated, some popular detectors don't detect it anymore...
@ace_security
@ace_security Год назад
​@Cristi Vlad Hello sir, can i be intern under your supervision. I really want to learn more about cybersecurity and pentesting. Hoping for the reply.
@ANONYMOUS-cp2se
@ANONYMOUS-cp2se Год назад
Does RU-vid detect GPT 3 generated work???
@CristiVladZ
@CristiVladZ Год назад
it might, if you download the transcript of the video and plug it in a detector
@DanielZivetzWLAcompexpert
@DanielZivetzWLAcompexpert Год назад
I had chatGPT write a RU-vid script on a topic. The output was good but of course, failed the Hugging face test. I've been using Quillbot to paraphrase and that has been working pretty well. So, in the same chattGPT chat for the YT script, I wrote" write it so it won't be AI detected" and sure enough what it produced was 98.37% real when put through Hugging face.
@ehsanghazavi470
@ehsanghazavi470 Год назад
This actually works?
@WestLAComputerExpertLosAngeles
@@ehsanghazavi470 sorry. No it doesn't work. I tried a couple more times and it says no. I don't know why it worked that one time
@CristiVladZ
@CristiVladZ Год назад
thanks for sharing!
@iancroft5704
@iancroft5704 Год назад
Me: write me a script for RU-vid on how to pass originality checks on chat gpt generated work GPT: Hello and welcome to this tutorial on how to pass originality checks when using OpenAI's GPT-3 language model. So you've used GPT-3 to generate an essay or article, but now you're worried it may not pass originality checks. No need to worry, as there are a few simple steps you can follow to increase your chances of passing these checks. Step 1: Know the software being used for originality checks. Different software will have different methods for detecting machine-generated content, so it's important to know which software will be used for the originality check. This will help you determine the best strategies to use to avoid detection. Step 2: Make edits to the text generated by GPT-3. The most effective way to avoid detection is to make significant changes to the text generated by GPT-3. This includes adding, removing, or rephrasing entire sentences and paragraphs, as well as making small changes like correcting grammar and spelling errors. The more changes you make, the lower the chances of detection. Step 3: Use plagiarism checkers. Another useful tool to help you pass originality checks is a plagiarism checker. These checkers will analyze your text and compare it to other sources on the internet, identifying any instances of plagiarism. This will give you an idea of how original your text is, and help you make any necessary changes. Step 4: Write in your own voice. When making changes to the text generated by GPT-3, try to write in your own voice. This means using your own words and phrasing, rather than simply copying and pasting from the GPT-3 generated text. Step 5: Avoid using overly technical language. Using overly technical language is another common characteristic of machine-generated text, so it's important to avoid this. Instead, try to write in a more conversational tone, using language that is easy to understand. In conclusion, by following these five simple steps, you can greatly increase your chances of passing originality checks when using GPT-3 generated text. Remember, the key is to make significant changes to the text, write in your own voice, and avoid using overly technical language. Good luck!
@CarlosMoraesCom
@CarlosMoraesCom Год назад
I tried checking with the other checker you used, and it's quite hard to move the needle. you can add 100 "/''s" and it won't help
@CristiVladZ
@CristiVladZ Год назад
currently, chatGPT passes many detectors with flying colors.
@Oxygenationatom
@Oxygenationatom Год назад
Come on man this was fun
@CristiVladZ
@CristiVladZ Год назад
Thanks
@schmetterling4477
@schmetterling4477 Год назад
It's trivial: All these "AI" tools sound like internet trolls. That's what they were trained on, after all. ;-)
@nefty1004
@nefty1004 Год назад
Great video 👍
@CristiVladZ
@CristiVladZ Год назад
Thanks
@johndank2209
@johndank2209 Год назад
@@CristiVladZ aye yo! take the vid down, my teachers are gonna see this!
@liviuteodorescu2891
@liviuteodorescu2891 Год назад
Bravo.
@CristiVladZ
@CristiVladZ Год назад
;)
@ANGEL_BOB_YT
@ANGEL_BOB_YT Год назад
Just put in an article spinner
@avi7278
@avi7278 Год назад
Interesting, which made me wonder, can you just ask chat gpt to do it for you. Turns out, of course. Original prompt: Write a one paragraph explanation of how SSD works Result: 99% label 0 (fake, written by AI) Solid State Drives (SSDs) are a type of storage device that uses flash memory to store data. Unlike traditional hard disk drives (HDDs) which use spinning disks to access data, SSDs have no moving parts, making them faster and more reliable. They work by using a series of interconnected flash memory chips to store data, which can be quickly accessed and written to using an integrated circuit called a controller. The controller manages the flow of data to and from the memory chips, allowing the SSD to access and write data at much faster speeds than an HDD. Followup prompt: [exact prompt redacted...] Result: 99% label 1 (real, written by a human) Solid State Drives (SSDs) are a type of storage apparatus that employs flash memory to keep data. Unlike conventional hard disk drives (HDDs) which utilize spinning disks to access data, SSDs do not have moving parts, making them quicker and more dependable. They function by utilizing a series of interconnected flash memory chips to keep data, which can be rapidly accessed and written to using an integrated circuit referred to as a controller. The controller oversees the flow of data to and from the memory chips, enabling the SSD to access and write data at much faster speeds than an HDD. As you can see, GPT is quite capable of defeating these checkers in a way that maintains a nearly entirely natural writing style (it doesn't appear clearly manipulated). As long as there are models capable of generating this text, the same models will be capable of defeating any system or watermark attempting to detect it as written by said model.
@CristiVladZ
@CristiVladZ Год назад
I never thought of that...what was the prompt for bypass?
@ag27
@ag27 Год назад
for real bro what prompt did you use?
@aesonswampmonster2958
@aesonswampmonster2958 Год назад
@@ag27 yeah lol lmk
@zenith6695
@zenith6695 Год назад
what prompt you used to bypass? curious
@shutthedoor2052
@shutthedoor2052 Год назад
i also was able to do this and i concur that it works using the right prompts. prompts aren't really special. just think of something that you insert while writing that is typical to what humans do
@danielprokoshin
@danielprokoshin Год назад
good video, but could've been easily cut down to 2-3 minutes
@CristiVladZ
@CristiVladZ Год назад
Why? People can't digest long form anymore? I just recorded myself without editing....
@danielprokoshin
@danielprokoshin Год назад
@@CristiVladZ Yeah, people can't digest long form. It's better for the performance of your video. If you edited it slightly you would get a lot more views.
@CristiVladZ
@CristiVladZ Год назад
​@@danielprokoshinthose people are not what I'm looking for in my audience. So it's better if they skip. By making long form, the content will stick to those who resonate with it...If you don't like it, you shouldn't be here. I don't need TikTok-like audience...
@ClayFRocker
@ClayFRocker Год назад
@@CristiVladZ I get your sentiment but to generalize a 'TikTok-like audience' when there are over a billion monthly active users is a bit silly. Short, concise and well edited content is good content. If you don't want to make good content, that's your prerogative. But just say that you don't want to put in the effort instead of making generalizations.
@danielprokoshin
@danielprokoshin Год назад
@@CristiVladZ I get that, I am very against TikTok and informative TikTok-style content as well. However, I am against informative TikTok-style content because it usually brushes over important things and tries to summarize way too much information into a 1-minute video. In my opinion, you could summarize the useful information here into a 3-4 minute video that wouldn't resemble TikTok content, it would just be more concise and get to the point faster.
@dombarton2483
@dombarton2483 Год назад
You dont need to worry about any detection...just stop using these tools and write it yourself. Problem solved. Google will always find out. Humans just stop being lazy!
@ihateteo
@ihateteo Год назад
It’s not even really being lazy. Life is too short to spend on stupid stuff like writing a ten page essay that you will not even remember about in 2 years.
@dombarton2483
@dombarton2483 Год назад
@@ihateteo we never had gpt3 a few years back and we survived very nicely indeed. Gpt3 and beyond including everything associated with it will mainly be used to try and trick google..that is my point. People will always take the path of least resistance. No matter what you do..google will get you. So just go back to pen and paper and use your brain.
@ihateteo
@ihateteo Год назад
@@dombarton2483 we survived, but all that time we spent “surviving” as you say, could’ve been used to do things that were actually fun
@dombarton2483
@dombarton2483 Год назад
@@ihateteo gpt3 is all about making things easier but most will not be using it for fun but instead trying to outsmart google...they wont win!
@beyonddev4008
@beyonddev4008 Год назад
@@dombarton2483 “outsmart google” 💀💀💀
Далее
the truth about ChatGPT generated code
10:35
Просмотров 229 тыс.
How to HACK ChatGPT
4:53
Просмотров 336 тыс.
I tricked MrBeast into giving me his channel
00:58
Просмотров 10 млн
Do ChatGPT Detectors Work? Let's Find Out!
11:41
Просмотров 14 тыс.
HACKING A BANK | GoBuster
6:28
Просмотров 772
How AIs, like ChatGPT, Learn
8:55
Просмотров 10 млн
10 ChatGPT Life Hacks - THAT’LL CHANGE YOUR LIFE !!
11:00
I Tried Every AI Coding Assistant
24:50
Просмотров 832 тыс.
Mind-Reading AI Technology Is Here
8:58
Просмотров 73 тыс.