LLM Insights

Infrastructure/Platform
- Hardware
- Alternative Accelerators
Data Engineering
- Common Crawl
Pre-Training Stage
Fine-Tuning Stage
- Adapters
Alignment Stage
- Reducing Hallucination
Evaluation Stage
- Reasoning
Inference Stage
Applications
- Agents

Acknowledgements

I'm grateful to my employers for trusting me to lead the team that built the GPU supercompute platform/infrastructure and to co-lead the team doing LLM pre-training. This allowed me to work on large on-premise GPU compute clusters with A100s and then H100s, which is certainly a privilege. Hopefully sharing some of these notes and insights helps the community.

Name		Name	Last commit message	Last commit date
Latest commit History 51 Commits
alignment/hallucination		alignment/hallucination
applications/agents		applications/agents
dataengineering		dataengineering
evaluation		evaluation
finetuning		finetuning
inference		inference
infrastructure/hardware		infrastructure/hardware
pretraining		pretraining
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

LLM Insights

Acknowledgements

About

Languages

eugenesiow/LLM-Insights

Folders and files

Latest commit

History

Repository files navigation

LLM Insights

Acknowledgements

About

Resources

Stars

Watchers

Forks

Languages