Neural Networks

From jWiki
Revision as of 13:14, 21 May 2024 by Snoopj (talk | contribs) (→‎Non-academic works: Add reference to original SolidGoldMagikarp posting)
(diff) ← Older revision | Latest revision (diff) | Newer revision → (diff)
Jump to navigationJump to search

Herein lie some of my thoughts and resources about neural networks. Because I work for a company that builds models for computer vision, I have a bit of a professional bias towards image models, but I have tried to represent my knowledge/opinions about a broader range of subjects here.

What do you think about generative "AI"?

tl;dr - mostly dancing bearware, some novel uses in responsibility laundering

Resources

Image models

Text models

For code

For everything else

  • Washington Post coverage of the data contained in the 'C4' dataset and how it influences the training of popular large models. Also allows users to check if arbitrary URLs are part of the dataset. (NOTE: C4 is not the only source of training text for the models being discussed, and the authors aren't doing a great job highlighting that, but it should still be pretty representative)
  • How well does ChatGPT speak Japanese? - an April 2023 evaluation of GPT-3.5 and GPT-4 performance on Japanese language assessments. Also includes an interesting comparison of the number of tokens required to represent the "Lord's Prayer" in multiple languages. I found the results of the latter particularly surprising.

Misc.

  • I gave a talk on the fundamentals of neural networks to Boston Python in March 2023
  • 3blue1brown has an excellent series of lessons about the fundamentals of neural networks. Particularly interesting to me is the lesson on backpropagation for its excellent visualization of the process of adjusting neural network weights.

Dumping ground

These references are totally unclassified

Writings by others

Academic works

Non-academic works

Lawsuits

The legal status of generative models and their implications for intellectual property in the US is something I'm trying to keep an eye on. The cases given below are of particular interest to me.

The New York Times Company v. MICROSOFT CORPORATION

Entry #252 in The New York Times Company v. Microsoft Corporation, 1:23-cv-11195
STIPULATED SOURCE CODE ORDER. IT IS HEREBY ORDERED that any person subject to this Source Code Orderincludingwithout limitation the parties to this action, their representatives, agents, experts an...
Entry #251 in The New York Times Company v. Microsoft Corporation, 1:23-cv-11195
STIPULATION AND ORDER RE: DISCOVERY OF ELECTRONICALLY STORED INFORMATION. To expedite the flow of discovery material and to facilitate the consistency in the format of the documents to be produced...
Entry #250 in The New York Times Company v. Microsoft Corporation, 1:23-cv-11195
LETTER addressed to Magistrate Judge Ona T. Wang from Joseph C. Gratz dated September 27, 2024 re: Status Update. Document filed by OAI Corporation, LLC, OpenAI GP, LLC, OpenAI Global, L.L.C., Open...

Andersen v. Stability AI Ltd.

Entry #232 in Andersen v. Stability AI Ltd., 3:23-cv-00201
ORDER DENYING 225 MOTION FOR CLARIFICATION OR RECONSIDERATION by Judge William H. Orrick. 228 Motion to Strike denied as moot. (jmd, COURT STAFF) (Filed on 9/30/2024) (Entered: 09/30/2024)
Entry #231 in Andersen v. Stability AI Ltd., 3:23-cv-00201
ORDER RE: CASE MANAGEMENT CONFERENCE by Judge William H. Orrick granting 230 Stipulation. Case Management Conference set for 10/29/2024 02:00 PM via Videoconference (Case Management Statement due b...
Entry #230 in Andersen v. Stability AI Ltd., 3:23-cv-00201
STIPULATION WITH PROPOSED ORDER Re: Case Management Conference filed by Sarah Andersen, Gerald Brom, Adam Ellis, Julia Kaye, Gregory Manchess, Kelly McKernan, Karla Ortiz, Grzegorz Rutkowski, H Sou...


Getty Images (US), Inc. v. Stability AI, Inc.

Entry #65 in Getty Images (US), Inc. v. Stability AI, Inc., 1:23-cv-00135
NOTICE requesting Clerk to remove Allyson R. Bennett as co-counsel. Reason for request: no longer with the firm. (Flynn, Michael) (Entered: 09/20/2024)
Entry #64 in Getty Images (US), Inc. v. Stability AI, Inc., 1:23-cv-00135
REQUEST for Oral Argument by Getty Images (US), Inc. re 45 MOTION to Dismiss for Failure to Join a Party MOTION to Dismiss for Lack of Jurisdiction Over the Person, 48 MOTION to Transfer Case to No...
Entry #63 in Getty Images (US), Inc. v. Stability AI, Inc., 1:23-cv-00135
REDACTED VERSION of 56 Declaration by Getty Images (US), Inc.. (Attachments: # 1 Exhibit A - V)(Vrana, Robert) (Entered: 08/21/2024)

Doe 1 v. GitHub, Inc.

Entry #282 in DOE 1 v. GitHub, Inc., 4:22-cv-06823
ORDER GRANTING MOTION TO CERTIFY ORDER FOR INTERLOCUTORY APPEAL AND MOTION TO STAY PENDING APPEAL by Judge Jon S. Tigar granting 268 Motion for Leave to Appeal. (dms, COURT STAFF) (Filed on 9/27/20...
Minute entry from 2024-09-17 in DOE 1 v. GitHub, Inc., 4:22-cv-06823
Notice of Appearance/Substitution/Change/Withdrawal of Attorney
Entry #281 in DOE 1 v. GitHub, Inc., 4:22-cv-06823
NOTICE of Withdrawal filed by Allyson Roz Bennett, no longer appearing on behalf of OAI CORPORATION,, OPENAI GLOBAL, LLC, OPENAI HOLDCO, LLC, OPENAI HOLDINGS, LLC, OPENAI STARTUP FUND SPV GP I, L.L...

Silverman v. OpenAI, Inc.

Entry #71 in Silverman v. OpenAI, Inc., 3:23-cv-03416
NOTICE of Change In Counsel by Joseph R. Saveri Withdrawal of Counsel - Kathleen J. McMahon (Saveri, Joseph) (Filed on 8/12/2024) (Entered: 08/12/2024)
Entry #70 in Silverman v. OpenAI, Inc., 3:23-cv-03416
PRETRIAL ORDER as Modified. Signed by Judge Araceli Martinez-Olguin on 2/16/2024. (ads, COURT STAFF) (Filed on 2/16/2024) (Entered: 02/16/2024)
Entry #69 in Silverman v. OpenAI, Inc., 3:23-cv-03416
Order as Modified by Judge Araceli Martinez-Olguin granting (60) Stipulation Consolidating Cases in case 3:23-cv-03223-AMO. Associated Cases: 3:23-cv-03223-AMO, 3:23-cv-03416-AMO, 3:23-cv-04625-AMO...

Kadrey v. Meta Platforms, Inc.

  • Similar suit to Silverman v. OpenAI, same parties etc.
  • Notable for a prominent dismissal of the class-action nature of the case, as the blatantly copied copyrighted works in the training data are not the works of the plaintiffs.
  • Latest [1]:
Entry #199 in Kadrey v. Meta Platforms, Inc., 3:23-cv-03417
Opposition/Response to Motion
Entry #198 in Kadrey v. Meta Platforms, Inc., 3:23-cv-03417
Judicial Referral for Purpose of Determining Relationship of Cases re 24-cv-6893 AMO and 23-cv-3417 VC. Signed by Judge Araceli Martinez-Olguin on 10/3/2024. (ads, COURT STAFF) (Filed on 10/3/2024)...
Entry #197 in Kadrey v. Meta Platforms, Inc., 3:23-cv-03417
ORDER by Magistrate Judge Thomas S. Hixson granting 191 Administrative Motion to File Under Seal. (rmm2, COURT STAFF) (Filed on 10/1/2024) (Entered: 10/01/2024)

Authors Guild v. OpenAI Inc.

Minute entry from 2024-10-01 in Authors Guild v. OpenAI Inc., 1:23-cv-08292
Discovery Hearing
Entry #214 in Authors Guild v. OpenAI Inc., 1:23-cv-08292
ORDER GRANTING ADMISSION OF JOHN R. LANHAM PRO HAC VICE granting (210) Motion for John Robert Lanham to Appear Pro Hac Vice in case 1:23-cv-08292-SHS-OTW. IT IS HEREBY ORDERED that Applicant admitt...
Entry #213 in Authors Guild v. OpenAI Inc., 1:23-cv-08292
STIPULATION AND ORDER REGARDING BASBANES LITIGATION. NOW, THEREFORE, in the interests of effectuating the Court's prior rulings, and adding further clarity and efficiency into the schedule, it...

Sancton v. OpenAI Inc. et al

[ Document 21]
ORDER granting #17 Motion for Alejandra Christina Salinas to Appear Pro Hac Vice (HEREBY ORDERED by Judge Sidney H. Stein)(Text Only Order) (lab)
2023-11-30 08:00:00
[ Document 20]
ORDER granting #14 Motion for Rohit Dwarka Nath to Appear Pro Hac Vice (HEREBY ORDERED by Judge Sidney H. Stein)(Text Only Order) (lab)
2023-11-30 08:00:00
[ Document 19]
ORDER granting #16 Motion for Justin Adatto Nelson to Appear Pro Hac Vice (HEREBY ORDERED by Judge Sidney H. Stein)(Text Only Order) (lab)
2023-11-30 08:00:00

Mata v. Avianca, Inc. (closed)

Note: this case is not about machine learning textually, but is included in this list because it is a notable example of gross misuse of a language model by plaintiff's counsel to submit falsified documents to the court. This led to censure of plaintiff's counsel and dismissal of the case.