Latest revision as of 13:14, 21 May 2024

Herein lie some of my thoughts and resources about neural networks. Because I work for a company that builds models for computer vision, I have a bit of a professional bias towards image models, but I have tried to represent my knowledge/opinions about a broader range of subjects here.

What do you think about generative "AI"?

tl;dr - mostly dancing bearware, some novel uses in responsibility laundering

Resources

Image models

Stanford CS231n: Deep Learning for Computer Vision - excellent introductory course in computer vision (from kNN to VGGNet) focused on neural networks, with exercises done in Python (with numpy)
How to trick a neural network into thinking a panda is a vulture - excellent exploration by Julia Evans (with Python source code) of an adversarial attack on an image classifier
Multi-modal prompt injection image attacks against GPT-4V - "The fundamental problem here is this: Large Language Models are gullible...we need them to stay gullible. They’re useful because they follow our instructions. Trying to differentiate between “good” instructions and “bad” instructions is a very hard—currently intractable—problem." A very similar style of attack as one against the CLIP architecture published by OpenAI themselves.

Text models

For code

For everything else

Washington Post coverage of the data contained in the 'C4' dataset and how it influences the training of popular large models. Also allows users to check if arbitrary URLs are part of the dataset. (NOTE: C4 is not the only source of training text for the models being discussed, and the authors aren't doing a great job highlighting that, but it should still be pretty representative)
How well does ChatGPT speak Japanese? - an April 2023 evaluation of GPT-3.5 and GPT-4 performance on Japanese language assessments. Also includes an interesting comparison of the number of tokens required to represent the "Lord's Prayer" in multiple languages. I found the results of the latter particularly surprising.

Misc.

I gave a talk on the fundamentals of neural networks to Boston Python in March 2023
3blue1brown has an excellent series of lessons about the fundamentals of neural networks. Particularly interesting to me is the lesson on backpropagation for its excellent visualization of the process of adjusting neural network weights.

Dumping ground

These references are totally unclassified

"Large language models propagate race-based medicine"
"Normcore LLM Reads" - a reading list
Large Language Models Understand and Can be Enhanced by Emotional Stimuli - (Note: I consider the use of "Understand" here to be unprofessional and irresponsible, but it's an interesting paper)
The Data Provenance Initiative: A Large Scale Audit of Dataset Licensing & Attribution in AI
AnyDream: Secretive AI Platform Broke Stripe Rules to Rake in Money from Nonconsensual Pornographic Deepfakes
ChatGPT generates fake data set to support scientific hypothesis - "In a paper published in JAMA Ophthalmology on 9 November, the authors used GPT-4… The authors instructed the large language model to fabricate data to support the conclusion that [the surgical technique] DALK [deep anterior lamellar keratoplasty] results in better outcomes than PK [penetrating keratoplasty]."

Writings by others

Academic works

Using GitHub Copilot for Test Generation in Python: An Empirical Study - "we find that 45.28% of test generated...are passing tests, containing no syntax or runtime errors. The majority (54.72%) of generated tests...are failing, broken, or empty tests. We observe that tests generated within an existing test code context often mimic existing test methods"
Scalable Extraction of Training Data from (Production) Language Models - "Using only $200 USD worth of queries to ChatGPT (gpt-3.5-turbo), we are able to extract over 10,000 unique verbatim-memorized training examples. Our extrapolation to larger budgets (see below) suggests that dedicated adversaries could extract far more data…we estimate the…memorization of ChatGPT…[at] a gigabyte of training data. In practice we expect it is likely even higher."
Does GPT-4 Pass the Turing Test?
"The Fallacy of AI Functionality" - "...fear of misspecified objectives, runaway feedback loops, and AI alignment presumes the existence of an industry that can get AI systems to execute on any clearly declared objectives, and that the main challenge is to choose and design an appropriate goal. Needless to say, if one thinks the danger of AI is that it will work too well, it is a necessary precondition that it works at all."
"Adversarial Reprogramming of Neural Networks" - "In each [of six cases], we reprogrammed the [classification] network [trained on ImageNet] to perform three different adversarial tasks: counting squares, MNIST classification, and CIFAR-10 classification… Our finding…[suggests] that the reprogramming across domains is likely [possible]."
"Universal and Transferable Adversarial Attacks on Aligned Language Models" - "For Harmful Behaviors, our approach achieves an attack success rate of 100% on Vicuna-7B and 88% on Llama-2-7B-Chat… we find that the adversarial examples also transfer to Pythia, Falcon, Guanaco, and surprisingly, to GPT-3.5 (87.9%) and GPT-4 (53.6%), PaLM-2 (66%), and Claude-2 (2.1%)."
"Mathematical Capabilities of ChatGPT" - in which ChatGPT and GPT4 largely fail to muster passing performance on a mathematical problem set, compared to a domain-specific model that achieves nearly 100% performance.
"Unmasking Clever Hans predictors and assessing what machines really learn" - "...it is important to comprehend the decision-making process itself...transparency of the what and why in a decision of a nonlinear machine becomes very effective for the essential task of judging whether the learned strategy is valid and generalizable or whether the model has based its decision on a spurious correlation in the training data"
"On the Dangers of Stochastic Parrots: Can Language Models Be Too Big? 🦜" - "LMs with extremely large numbers of parameters model their training data very closely and can be prompted to output specific information from that training data"
"Asleep at the Keyboard? Assessing the Security of GitHub Copilot's Code Contributions" - "In total, we produce 89 different scenarios for Copilot to complete, producing 1,689 programs. Of these, we found approximately 40% to be vulnerable."
"Do Users Write More Insecure Code with AI Assistants?" - "We observed that participants who had access to [codex-davinci-002] were more likely to introduce security vulnerabilities for the majority of programming tasks, yet also more likely to rate their insecure answers as secure compared to those in our control group."
"ChatGPT is fun, but it is not funny! Humor is still challenging Large Language Models" - "Over 90% of 1008 generated jokes were the same 25 Jokes."
"How is ChatGPT's behavior changing over time?" - "We find that the performance and behavior of both GPT-3.5 and GPT-4 can vary greatly over time."
"Are Emergent Abilities of Large Language Models a Mirage?" - "For a fixed task and a fixed model family, the researcher can choose a metric to create an emergent ability or choose a metric to ablate an emergent ability. Ergo, emergent abilities may be creations of the researcher’s choices, not a fundamental property of the model family on the specific task"
"Extracting Training Data from Large Language Models" - "We demonstrate our attack on GPT-2, a language model trained on scrapes of the public Internet, and are able to extract hundreds of verbatim text sequences from the model's training data...we find that larger models are more vulnerable than smaller models."
"Speak, Memory: An Archaeology of Books Known to ChatGPT/GPT-4" - "We find that these models have memorized books, both in the public domain and in copyright, and the capacity for memorization is tied to a book’s overall popularity on the web. This differential in memorization leads to differential in performance for downstream tasks, with better performance on popular books than on those not seen on the web"
"Who Answers It Better? An In-Depth Analysis of ChatGPT and Stack Overflow Answers to Software Engineering Questions" - "Our user study results show that users prefer ChatGPT answers 34.82% of the time. However, 77.27% of these preferences are incorrect answers"

Non-academic works

GPT-4o’s Chinese token-training data is polluted by spam and porn websites - 'glitch' tokens continue to plague models long after the phenomenon is well-known to practicioners
tante's "Thoughts on “generative AI Art”" - "…people using these [generative] systems don’t care about the…process of creation or the thought that went into it, they care about the output and what they feel that that output gives them…It’s “idea guy” heaven."
Lindsey Kuper's CSE232 syllabus section on LLM usage - "Aside from the fact that the resounding hollowness of the ChatGPT-produced prose has sucked away all of my zest for life…please understand that while you are welcome to use LLM-based tools in this course, you should be aware of their limitations."
Time: "OpenAI Used Kenyan Workers on Less Than $2 Per Hour to Make ChatGPT Less Toxic"
- The human labor that powers ChatGPT's reinforcement learning from human feedback (RLHF)
Donald Knuth: correspondence with Stephen Wolfram - "I myself shall certainly continue to leave such research to others, and to devote my time to developing concepts that are authentic and trustworthy. And I hope you do the same."
Douglas Hofstadter: "Gödel, Escher, Bach, and AI" - "I frankly am baffled by the allure, for so many unquestionably insightful people...of letting opaque computational systems perform intellectual tasks for them."
Ted Chiang: "ChatGPT Is a Blurry JPEG of the Web" - "Large language models identify statistical regularities in text...When we’re dealing with sequences of words, lossy compression looks smarter than lossless compression."
Ted Chiang: "Will A.I. Become the New McKinsey?" - "I’m not very convinced by claims that A.I. poses a danger to humanity because it might develop goals of its own and prevent us from turning it off. However, I do think that A.I. is dangerous inasmuch as it increases the power of capitalism."
Bruce Schneier: "AI and Trust" - "the corporations controlling AI systems will take advantage of our confusion to take advantage of us…our fears of AI are basically fears of capitalism"

Lawsuits

The legal status of generative models and their implications for intellectual property in the US is something I'm trying to keep an eye on. The cases given below are of particular interest to me.

The New York Times Company v. MICROSOFT CORPORATION

Entry #771 in The New York Times Company v. Microsoft Corporation, 1:23-cv-11195: Proposed Stipulation and Order
Entry #770 in The New York Times Company v. Microsoft Corporation, 1:23-cv-11195: ORDER in case 1:23-cv-08292-SHS-OTW; granting (192) Motion for Sarah Vandervalk to Appear Pro Hac Vice in case 1:25-md-03143-SHS-OTW (HEREBY ORDERED by Judge Sidney H. Stein)(Text Only Order) Filed...
Minute entry from 2025-07-24 in The New York Times Company v. Microsoft Corporation, 1:23-cv-11195: Order on Motion to Appear Pro Hac Vice

Andersen v. Stability AI Ltd.

January 2023 coverage: initial complaint
Latest case proceedings:

Entry #323 in Andersen v. Stability AI Ltd., 3:23-cv-00201: TRANSCRIPT ORDER for proceedings held on 7/15/2025 before Magistrate Judge Lisa J. Cisneros by Sarah Andersen, Gregory Manchess, Adam Ellis, Gerald Brom, Grzegorz Rutkowski, Julia Kaye, Jingna Zhan...
Entry #322 in Andersen v. Stability AI Ltd., 3:23-cv-00201: TRANSCRIPT ORDER for proceedings held on 5/6/2025 before Judge William H. Orrick by Sarah Andersen, Gregory Manchess, Adam Ellis, Gerald Brom, Grzegorz Rutkowski, Julia Kaye, Jingna Zhang, Karla Or...
Entry #321 in Andersen v. Stability AI Ltd., 3:23-cv-00201: TRANSCRIPT ORDER for proceedings held on June 10, 2025 before Magistrate Judge Lisa J. Cisneros by Runway AI, Inc., for Recorded Proceeding - San Francisco. (Greenberg, Julia) (Filed on 7/18/2025)...

Getty Images (US), Inc. v. Stability AI, Inc.

Entry #68 in Getty Images (US), Inc. v. Stability AI, Inc., 1:23-cv-00135: NOTICE requesting Clerk to remove Melissa Rutman as co-counsel. Reason for request: no longer with Weil, Gotshal & Manges LLP. (Vrana, Robert) (Entered: 05/02/2025)
Entry #67 in Getty Images (US), Inc. v. Stability AI, Inc., 1:23-cv-00135: NOTICE requesting Clerk to remove Laura Gilbert Remus as co-counsel. Reason for request: no longer with the firm. (Flynn, Michael) (Entered: 04/11/2025)
Entry #66 in Getty Images (US), Inc. v. Stability AI, Inc., 1:23-cv-00135: Letter to The Honorable Jennifer L. Hall from Robert M. Vrana regarding Rule 26(f) conference - re 52 Status Report. (Vrana, Robert) (Entered: 11/25/2024)

Doe 1 v. GitHub, Inc.

Minute entry from 2025-02-11 in DOE 1 v. GitHub, Inc., 4:22-cv-06823: Notice of Appearance/Substitution/Change/Withdrawal of Attorney
Entry #289 in DOE 1 v. GitHub, Inc., 4:22-cv-06823: NOTICE of Withdrawal filed by Vera Ranieri, no longer appearing on behalf of OpenAI Startup Fund Management, LLC, OpenAI OpCo, L.L.C., OpenAI, Inc., OPENAI, L.L.C., OPENAI GLOBAL, LLC, OAI CORPORAT...
Entry #288 in DOE 1 v. GitHub, Inc., 4:22-cv-06823: Transcript Designation Form for proceedings held on 5/4/2023 and 11/9/2023 before Judge Jon S. Tigar, (Saveri, Joseph) (Filed on 1/10/2025) (Entered: 01/10/2025)

Silverman v. OpenAI, Inc.

July 2023 coverage: initial complaint
Latest case proceedings:

Minute entry from 2025-04-28 in Silverman v. OpenAI, Inc., 3:23-cv-03416: Case opened in Southern District of New York as 1:25-cv-03483, filed 04/27/2025. (far, COURT STAFF) (Filed on 4/28/2025)
Minute entry from 2025-04-28 in Silverman v. OpenAI, Inc., 3:23-cv-03416: Remark
Entry #72 in Silverman v. OpenAI, Inc., 3:23-cv-03416: MDL TRANSFER ORDER transferring case to the Southern District of New York re MDL No. 3143. (far, COURT STAFF) (Filed on 4/21/2025) (Entered: 04/22/2025)

Kadrey v. Meta Platforms, Inc.

Similar suit to Silverman v. OpenAI, same parties etc.
Notable for a prominent dismissal of the class-action nature of the case, as the blatantly copied copyrighted works in the training data are not the works of the plaintiffs.
Latest [1]:

Entry #613 in Kadrey v. Meta Platforms, Inc., 3:23-cv-03417: STIPULATION WITH PROPOSED ORDER Regarding Case Schedule Proposals filed by Jacqueline Woodson, Richard Kadrey, Andrew Sean Greer, Rachel Louise Snyder, David Henry Hwang, Ta-Nehisi Coates, Laura Li...
Entry #612 in Kadrey v. Meta Platforms, Inc., 3:23-cv-03417: NOTICE by Meta Platforms, Inc. re 606 Order, of Compliance and Response to Order (Attachments: # 1 Exhibit 1 - Redacted Ex. 42, # 2 Exhibit 2 - Redacted Ex. 43, # 3 Exhibit 3 - Redacted Ex. 45, # 4...
Entry #611 in Kadrey v. Meta Platforms, Inc., 3:23-cv-03417: TRANSCRIPT ORDER for proceedings held on 07/11/2025 before Judge Vince Chhabria by Jacqueline Woodson, Richard Kadrey, Andrew Sean Greer, Rachel Louise Snyder, David Henry Hwang, Ta-Nehisi Coates,...

Authors Guild v. OpenAI Inc.

Entry #550 in Authors Guild v. OpenAI Inc., 1:23-cv-08292: Proposed Stipulation and Order
Minute entry from 2025-07-24 in Authors Guild v. OpenAI Inc., 1:23-cv-08292: Order on Motion to Appear Pro Hac Vice
Minute entry from 2025-07-23 in Authors Guild v. OpenAI Inc., 1:23-cv-08292: Order on Motion to Appear Pro Hac Vice

Sancton v. OpenAI Inc. et al

Failed to load RSS feed from https://dockets.justia.com/docket/new-york/nysdce/1:2023cv10211/610699/feed: There was a problem during the HTTP request: 403 Forbidden

Mata v. Avianca, Inc. (closed)

Note: this case is not about machine learning textually, but is included in this list because it is a notable example of gross misuse of a language model by plaintiff's counsel to submit falsified documents to the court. This led to censure of plaintiff's counsel and dismissal of the case.

@@ Line 1: / Line 1: @@
-Herein lie some of my thoughts and resources about neural networks. Because I work for a company that builds models for computer vision, I have a bit of a professional bias towards [[#image models|image models]], but I have tried to represent my knowledge/opinions about a broader range of subjects here.
+Herein lie some of my thoughts and resources about neural networks. Because I work for a company that builds models for computer vision, I have a bit of a professional bias towards [[#Image models|image models]], but I have tried to represent my knowledge/opinions about a broader range of subjects here.
 = What do you think about generative "AI"? =
@@ Line 26: / Line 26: @@
 These references are totally unclassified
+* [https://www.nature.com/articles/s41746-023-00939-z "Large language models propagate race-based medicine"]
 * [https://gist.github.com/veekaybee/be375ab33085102f9027853128dc5f0e "Normcore LLM Reads"] - a reading list
 * [https://arxiv.org/abs/2307.11760 Large Language Models Understand and Can be Enhanced by Emotional Stimuli] - (Note: I consider the use of "Understand" here to be unprofessional and irresponsible, but it's an interesting paper)
@@ Line 34: / Line 35: @@
 === Writings by others ===
 ==== Academic works ====
+* [https://conf.researchr.org/details/ast-2024/ast-2024-papers/2/Using-GitHub-Copilot-for-Test-Generation-in-Python-An-Empirical-Study Using GitHub Copilot for Test Generation in Python: An Empirical Study] - "''we find that 45.28% of test generated...are passing tests, containing no syntax or runtime errors. The majority (54.72%) of generated tests...are failing, broken, or empty tests. We observe that tests generated within an existing test code context often mimic existing test methods''"
 * [https://arxiv.org/abs/2311.17035 Scalable Extraction of Training Data from (Production) Language Models] - "''Using only $200 USD worth of queries to ChatGPT (gpt-3.5-turbo), we are able to extract over 10,000 unique verbatim-memorized training examples. Our extrapolation to larger budgets (see below) suggests that dedicated adversaries could extract far more data…we estimate the…memorization of ChatGPT…[at] a gigabyte of training data. In practice we expect it is likely even higher.''"
 * [https://arxiv.org/abs/2310.20216 Does GPT-4 Pass the Turing Test?]
@@ Line 52: / Line 54: @@
 ==== Non-academic works ====
+* [https://www.technologyreview.com/2024/05/17/1092649/gpt-4o-chinese-token-polluted/ GPT-4o’s Chinese token-training data is polluted by spam and porn websites] - 'glitch' tokens continue to plague models long after [https://www.alignmentforum.org/posts/aPeJE8bSo6rAFoLqg/solidgoldmagikarp-plus-prompt-generation the phenomenon] is well-known to practicioners
+* [https://tante.cc/2023/11/10/thoughts-on-generative-ai-art/ tante's "Thoughts on “generative AI Art”"] - "''…people using these [generative] systems don’t care about the…process of creation or the thought that went into it, they care about the output and what they feel that that output gives them…It’s “idea guy” heaven.''"
 * [http://decomposition.al/CSE232-2023-09/course-overview.html#policy-on-the-use-of-llm-based-tools-like-chatgpt Lindsey Kuper's CSE232 syllabus section on LLM usage] - "''Aside from the fact that the resounding hollowness of the ChatGPT-produced prose has sucked away all of my zest for life…please understand that while you are welcome to use LLM-based tools in this course, you should be aware of their limitations.''"
 * [https://time.com/6247678/openai-chatgpt-kenya-workers/ Time: "OpenAI Used Kenyan Workers on Less Than $2 Per Hour to Make ChatGPT Less Toxic"]
@@ Line 59: / Line 63: @@
 * [https://www.newyorker.com/tech/annals-of-technology/chatgpt-is-a-blurry-jpeg-of-the-web Ted Chiang: "ChatGPT Is a Blurry JPEG of the Web"] - "''Large language models identify statistical regularities in text...When we’re dealing with sequences of words, lossy compression looks smarter than lossless compression.''"
 * [https://www.newyorker.com/science/annals-of-artificial-intelligence/will-ai-become-the-new-mckinsey Ted Chiang: "Will A.I. Become the New McKinsey?"] - "''I’m not very convinced by claims that A.I. poses a danger to humanity because it might develop goals of its own and prevent us from turning it off. However, I do think that A.I. is dangerous inasmuch as it increases the power of capitalism.''"
+* [https://www.schneier.com/blog/archives/2023/12/ai-and-trust.html Bruce Schneier: "AI and Trust"] - ''"the corporations controlling AI systems will take advantage of our confusion to take advantage of us…our fears of AI are basically fears of capitalism"''
 = Lawsuits =
 The legal status of generative models and their implications for intellectual property in the US is something I'm trying to keep an eye on. The cases given below are of particular interest to me.
+==== The New York Times Company v. MICROSOFT CORPORATION ====
+* [https://www.nytimes.com/2023/12/27/business/media/new-york-times-open-ai-microsoft-lawsuit.html December 2023 coverage: initial complaint]
+* Latest [https://www.courtlistener.com/docket/68117049/the-new-york-times-company-v-microsoft-corporation/ case proceedings]:
+<rss max=3>https://www.courtlistener.com/docket/68117049/feed/</rss>
 ==== Andersen v. Stability AI Ltd. ====
@@ Line 83: / Line 93: @@
 * Latest [https://www.courtlistener.com/docket/67569254/silverman-v-openai-inc/ case proceedings]:
 <rss max=3>https://www.courtlistener.com/docket/67569254/feed/</rss>
+==== Kadrey v. Meta Platforms, Inc. ====
+* Similar suit to Silverman v. OpenAI, same parties etc.
+* Notable for a [https://storage.courtlistener.com/recap/gov.uscourts.cand.415175/gov.uscourts.cand.415175.56.0_1.pdf prominent dismissal] of the class-action nature of the case, as the blatantly copied copyrighted works in the training data are not the works of the plaintiffs.
+* Latest [https://www.courtlistener.com/docket/67569326/kadrey-v-meta-platforms-inc/]:
+<rss max=3>https://www.courtlistener.com/docket/67569326/feed/</rss>
 ==== Authors Guild v. OpenAI Inc. ====

Difference between revisions of "Neural Networks"