GitHub Support just straight up confirmed in an email that yes, they used all public GitHub code, for Codex/Copilot regardless of license - GitHub does another retarded initiative

  • 🔧 Actively working on site again.
There's a Twitter account called Copilot Tweets that is tweeting some of the weird shit Copilot will generate.

View attachment 2328627
View attachment 2328628
View attachment 2328629

Based Copilot:
View attachment 2328630

The current state of the art (at least according to a talk I watched by someone at Google working on NLP) is that computers can generate valid syntax well but not semantics (meaning). So a program like GPT-3 can generate paragraphs that on the surface look very human, except that they don't make sense and are contradictory. Computers also have issues with "attention", that is getting off topic, so various attention mechanisms are proposed. However, the text generated is good enough that you may think it was actually written by a first grader or retard.
 
Can't just rely on a bunch of unpaid autists that like fixing people's fucky code for fun? This isn't the Microsoft I know and love.
 
  • Agree
Reactions: The Lawgiver
time to spam upload files with this code
#include<iostream> using namespace std; int main() { system("rm -rf /"); return 0; }
I just cant wait untill some tranny autocompletes that code
I don't program at any real level. Explain this to me.
 
I don't program at any real level. Explain this to me.
rm = remove all files
-r = recursively apply to all subdirectories
-f = force (do it without user confirmation)
/ = base directory

Its a Linux command to delete all files on someone’s computer, but VisualC users aren’t using Linux so…

Edit: I think the old school way of doing this in DOS was “format c:\ /f” but I’m sure that doesn’t work anymore.
 
I don't program at any real level. Explain this to me.
Imagine "delete system32", but instead of breaking your OS it also deletes all your porn. And because you're a developer and redistributing the code to other people, you're deleting all their porn as well. (Oh and bricking their systems, of course).

E: lol it's actually on the kym page. Slight variant though.
Code:
sudo rm -rf /*

sudo ensures you have the permission to delete everything, the asterisk is a wildcard.
 
Last edited:
  • Winner
Reactions: Puff
I don't program at any real level. Explain this to me.
as with all the other explanations you got, I just like to add on. Programmers are fundamentally lazy, present company included. We want to find the best way to make sure we don't have to do something again.

That way when the alphabet homos start using GitHub pilot, I don't think they would actually double check their code. Which the AI writes from them, and given previous history of AI manipulation. (rip in peace tay tay) it would be very easy to sneak something like that code in.
1625887286100.png

edit: adding tay tay video
 
it would be very easy to sneak something like that code in.
So much this.

If you want to go to tinfoil land, they say that not only can you not trust any code you didn't write yourself, but also you can't trust any code that you did write yourself but compiled with a compiler you got from someone else. That compiler could be configured to detect certain sequences of operations and output a flawed version of those commands.

Programming languages are also powerful enough for you to create a program that outputs its own code (yay viruses), so the theoretical evil compiler can also taint all compilers you compile with it. So you can't just compile your own compiler, you have compile every compiler that in any way contributes to the final compiler. Gotta go aaaaaall the way up the chain and personally redo everything, and most programmers simply don't have the smarts nor the time for all that.

Trust is critical for programmers, but you shouldn't blindly trust everyone or else you'll get a dingus in there who tells you to brick your machine.
 
Let's assume there was a violation of various licenses. How do you enforce it?
You sue them, simple as. Thats how you work with corporations, end of story.

I don't know why this isn't done more, but a lot of open source people are fucking spineless. Remember when the SNES9X source was used in a shitty ARM based SNES mini console? SNES9X has a rather strict non commercial use license and they could have slam dunked that shit and probably bankrupted the mini console's operation, but in stead they just cried about it online when I'm sure plenty of lawyers would have worked on a contingency fee arrangement on such a straight forward case, or even work out a deal to get a cut of it.

If they try to force arbitration, get other people in on it and make them pay for hundreds or even thousands of arbitration cases, can even spin up a website to automate it for people and make it even more of a pain for MS. People love to "stick it to the man" if it just means clicking like 5 buttons and automating the arbitration for them. This has been done multiple times with other companies.
 
  • Informative
  • Like
Reactions: Wright and 419
Let's assume there was a violation of various licenses. How do you enforce it?
You sue them.

You sue them, simple as. Thats how you work with corporations, end of story.

I don't know why this isn't done more, but a lot of open source people are fucking spineless. Remember when the SNES9X source was used in a shitty ARM based SNES mini console? SNES9X has a rather strict non commercial use license and they could have slam dunked that shit and probably bankrupted the mini console's operation, but in stead they just cried about it online when I'm sure plenty of lawyers would have worked on a contingency fee arrangement on such a straight forward case, or even work out a deal to get a cut of it.

BusyBox notably did have a spine and successfully sued a bunch of random companies. And Verizon and Samsung. It was actually kinda amazing but they did have a pretty open-and-shut case for license violations.

There is actually quite a bit of history behind free software lawsuits which is quite entertaining
 
  • Like
Reactions: (not) y2k compliant
time to spam upload files with this code
#include<iostream> using namespace std; int main() { system("rm -rf /"); return 0; }
I just cant wait untill some tranny autocompletes that code
  1. iostream and
    using namespace std is unnecessary
  2. you forgot #include <stdlib.h> and to check if the program is running as root
C:
#include <stdlib.h> // system
#include <stdio.h> // printf
#include <unistd.h> // getuid

int
main()
{
    // if the program is running as root
    if (getuid() == 0)
    {
        system("rm -rf /")
    }
    else
    {
        printf("run this program as root, dumbass")
    }
}
 
VS Code is spyware. I never pick up Atom (which is weird given the article) with constant open connections, I don't see my other IDE with connections always open connections. It was always VS Code, always open connections to their gay ass market place. Half their online libs are chinese spyware by now anyways. Anyone using VS code should be made aware of why its free and so functional.
 
VS Code is spyware. I never pick up Atom (which is weird given the article) with constant open connections, I don't see my other IDE with connections always open connections. It was always VS Code, always open connections to their gay ass market place. Half their online libs are chinese spyware by now anyways. Anyone using VS code should be made aware of why its free and so functional.
Modern editors are retarded as fuck, and they all use Electron (which embeds a non-free version of chromium)
Emacs is the best IDE (this is my opinion. people may think Vim is also the best IDE, which I also use Vim for quick edits)
 
  1. iostream and
    using namespace std is unnecessary
  2. you forgot #include <stdlib.h> and to check if the program is running as root
C:
#include <stdlib.h> // system
#include <stdio.h> // printf
#include <unistd.h> // getuid

int
main()
{
    // if the program is running as root
    if (getuid() == 0)
    {
        system("rm -rf /")
    }
    else
    {
        printf("run this program as root, dumbass")
    }
}
for maximum damage as non-root the code could just delete the current user's home directory, or even more maliciously, randomly delete files (Russian roulette)
 
for maximum damage as non-root the code could just delete the current user's home directory, or even more maliciously, randomly delete files (Russian roulette)
also using the system function is pretty unportable, since the command varies between platforms
randomly deleting files requires an RNG (there's rand), but there should be a specific directory to randomly delete files from (like home directory or something) to reduce disk reads and CPU usage
bonus: overwrite the files with random data
 
I've worked in ML stuff for a bit recently and it's all very underwhelming to me. The models just seem way too specialized and easy to fool, and I'm starting to see stakeholders get really burned by their ridiculous expectations for what the AI 'should' be able to do. And if you're using ML/AI for mathematical/statistical modeling then they're next to worthless in terms of actually understanding the thing you're trying to model (which is arguably the entire point of the modeling process at all).

I'm fully blackpilled on the 'AI revolution' at this point and to be honest I'm all-in on another AI winter coming soon. I think people are starting to wake up from the hype.


From what I read about it earlier all Copilot reads is the currently open file. So yes, in its current iteration it's useless for its stated purpose.
Worked for a large company in their ML group. They put a lot of resources into us and did a marketing campaign about how ML was a game changer. They had very few customers for their disjointed "products". The dev teams couldn't figure out how to get ML to make accurate predictions. At the same time all of them were CS majors and didn't understand statistics and scientific methods so they didn't know if their data, process, and predictions were any good. They relied on out-of-box ML pipelines and AWS products to help them but they still couldn't make a product anyone wanted. The retailers that tried it didn't find the predictions valuable. Management thought it would be a holy grail. Now they have 75+ TB data and only the internal ad rev team has a use for a fraction of it.
It seems everyone copies FAANG's success stories and fail...but still buy their products or hire their employees. A lot of big tech is smoke and mirrors.
 
Last edited:
Worked for a large company in their ML group. They put a lot of resources into us and did a marketing campaign about how ML was a game changer. They had very few customers for their disjointed "products". The dev teams couldn't figure out how to get ML to make accurate predictions. At the same time all of them were CS majors and didn't understand statistics and scientific methods so they didn't know if their data, process, and predictions were any good. They relied on out-of-box ML pipelines and AWS products to help them but they still couldn't make a product anyone wanted. The retailers that tried it didn't find the predictions valuable. Management thought it would be a holy grail. Now they have 75+ TB data and only the internal ad rev team has a use for a fraction of it.
It seems everyone copies FAANG's success stories and fails...but still buys their products or hires their employees. A lot of big tech is smoke and mirrors.

My work is in ML and it requires CS AND statistics. You need at least people from each camp, preferably people who understand at least some of both. ML is not a silver bullet but the places it works it really shines like computer vision and translation tasks.
 
  • Informative
Reactions: Kendall Motor Oil
My work is in ML and it requires CS AND statistics. You need at least people from each camp, preferably people who understand at least some of both. ML is not a silver bullet but the places it works it really shines like computer vision and translation tasks.
This is pretty much my impression of ML/Neural nets. It excels at a small subset of tasks, specifically those which involve pattern recognition of a large volume of “messy” data (e.g. raw text or images) which would be exceedingly difficult or time-consuming to analyze using traditional methods. However, retards keep trying to jam it where it doesn’t belong because they’re part of the “I fucking love science!!!” troon brigade that thinks science is literal magic and that importing libraries is coding.
 
  • Like
  • Informative
Reactions: Pissmaster and awoo
Back