Appendix B: AI & LLMs & data visualization
1 Using LLMs/foundationa models to built predictive models
1.1 Attention: Hallucination..
- Attention: Always cross-validate the information given by a LLM
- Why? Hallucination.. (see characterization statements on Wikipedia)
- “a tendency to invent facts in moments of uncertainty” (OpenAI, May 2023)
- “a model’s logical mistakes” (OpenAI, May 2023)
- fabricating information entirely, but behaving as if spouting facts (CNBC, May 2023)
- “making up information” (The Verge, February 2023)
- Why? Hallucination.. (see characterization statements on Wikipedia)
- Very good overview on Wikipedia
- Discussions in (Zhang2023-ok?), (Huang2023-zf?) and (Metz2023-qz?)
1.2 Avaible LLMs
- Closed-source
- ChatGPT X (OpenAI, ~Microsoft): https://chat.openai.com/
- Gemini (Google) https://gemini.google.com/
- Amazon Titan: https://aws.amazon.com/bedrock/titan/
- Open-source
- HuggingChat: https://huggingface.co/chat/
- Curated list of papers about large language models
- Top Open-Source LLMs for 2024 and Their Uses
1.3 Useful prompts
- LLMs can be used to generate code for data visualization
I have a dataset called "data" that includes the variable age. Please provide me with ggplot code to produce a histogram.
Please explain the code (add comments to the code).
I want to change the x-axis lables (angle 50%).
I can I encode data dimensions in a graph? What possibilities do I have?
How can I ideally visualize a linegraph where the two lines are perfectly overlapping each other but I want to visualize just that.