Repopack (now Repomix): Pack Your Entire Repository Into A Single File

Update: Due to legal considerations, this project has been renamed from “Repopack” to “Repomix”

Repopack is a new open-source tool I’ve been testing out. It’s designed to help developers work more efficiently with LLM coding assistants. It tackles a common headache for developers: efficiently sharing entire codebases with large language models.

Most LLM models have strict token limits, making it tough to share large codebases or complex project structures. This often leads to a tedious process of manually copying and pasting relevant code snippets, which is not only time-consuming but also prone to errors.

Another issue is the inconsistent formatting across different parts of a project, which can sometimes confuse LLM models. There’s also the ever-present worry of accidentally sharing sensitive information when selecting code to share.

Most importantly, when we only share isolated parts of a project with an LLM assistant, we rob it of the broader context. This can lead to suggestions that don’t quite fit with the overall architecture or project goals.

What is Repopack?

At its core, Repopack is a command-line tool that packages your entire code repository into a single file. This file is formatted in a way that’s easy for LLM models like GPT-4, Claude, or Gemini to process.

1
➜  trevorlasn.com git:(master) ✗ repopack --include "src"
2

3
📦 Repopack v0.1.43
4

5
✔ Packing completed successfully!
6

7
📈 Top 5 Files by Character Count and Token Count:
8
──────────────────────────────────────────────────────
9
1.  src/content/blog/benchmarks-for-node-bun-deno/index.md (65603 chars, 11330 tokens)
10
2.  src/content/blog/common-causes-of-memory-leaks-in-javascript/index.md (42041 chars, 11181 tokens)
11
3.  src/content/blog/google-journey-from-search-engine-to-tech-giant/index.mdx (24245 chars, 5025 tokens)
12
4.  src/content/blog/micro-frontends-what-they-are-and-when-to-use/index.mdx (19897 chars, 5958 tokens)
13
5.  src/content/blog/10-essential-terminal-commands-every-developer-should-know/index.md (18327 chars, 4892 tokens)
14

15
🔎 Security Check:
16
──────────────────
17
✔ No suspicious files detected.
18

19
📊 Pack Summary:
20
────────────────
21
  Total Files: 123
22
  Total Chars: 738930
23
 Total Tokens: 167841
24
       Output: repopack-output.txt
25
     Security: ✔ No suspicious files detected
26

27
🎉 All Done!
28
Your repository has been successfully packed.

Now if I want to share my project with LLM models, I can simply send them the repopack-output.txt file. This way, the LLM model can see the entire project structure and context, making it easier to provide relevant suggestions.

1
================================================================
2
Repository Structure
3
================================================================
4
src/
5
  components/
6
    ArrowCard.jsx
7
    BackToPrev.astro
8
    BackToTop.astro
9
    Container.astro
10
    ContentContainer.astro
11
    Footer.astro
12
    FormattedDate.astro
13
    Head.astro
14
    Header.astro
15
    InfiniteScrollPosts.jsx
16
    Link.astro
17
    SearchModal.astro
18
    ThemeSwitcher.tsx
19
  content/
20
    blog/
21
       explicit-is-better-than-implicit/
22
        index.mdx
23
      10-essential-terminal-commands-every-developer-should-know/
24
        index.md
25
      2020-programming-trend-predictions/
26
        index.md
27
      39-percent-companies-losing-control-of-it-and-security/
28
        index.mdx
29
      a-company-is-not-a-family-its-a-sports-team/
30
        index.mdx
31
      a-great-product-doesnt-need-marketing/
32
        index.md
33
      ageism-in-tech/
34
        index.mdx
35
      aggregate-error-in-javascript/
36
        index.md
37
      all-you-need-to-know-about-css-in-js/
38
        index.md
39
      amazon-rise-to-tech-titan/
40
        index.mdx
41
      amazons-no-weasel-words-rule/
42
        index.md
43
      astro-capo/
44
        index.mdx
45
      attracting-top-engineering-talent/
46
        index.md
47
      become-a-web-developer-in-180-days/
48
        index.md
49
      being-a-self-taught-developer/
50
        index.mdx
51
      benchmarks-for-node-bun-deno/
52
        index.md
53
      build-your-army/
54
        index.mdx
55
      cloudflare-ai-content-control/
56
        index.mdx
57
      code-wins-arguments/
58
        index.md
59
      common-causes-of-memory-leaks-in-javascript/
60
        index.md
61
      conways-Law/
62
        index.mdx
63
      csp-headers-astro/
64
        index.mdx
65
      culture-happens-outside-management/
66
        index.md
67
      demystifying-react-hooks/
68
        index.md
69
      dependency-time-machine/
70
        index.md
71
      docker-with-react/
72
        index.md
73
      easy-guide-for-webpack-2-0-from-scratch/
74
        index.md
75
      embrace-early-returns-and-intermediate-variables-for-readable-code/
76
        index.md
77
      engineering-managers-should-write-code/
78
        index.md
79
      eslint-plugin-depend/
80
        index.md
81
      evolve-or-become-irrelevant/
82
        index.md
83
      frontend-security-checklist/
84
        index.md
85
      google-chrome-built-in-gemini-nano/
86
        index.md
87
      google-is-killing-information-economics-on-the-internet/
88
        index.md
89
      google-journey-from-search-engine-to-tech-giant/
90
        index.mdx
91
      how-to-fetch-data-from-an-api-with-react-hooks/
92
        index.md
93
      how-to-launch-software-projects-on-time-and-on-budget/
94
        index.mdx
95
      how-to-restore-your-passion-for-programming/
96
        index.md
97
      how-to-use-redux-with-react-hooks/
98
        index.md
99
      increase-react-redux-application-performance-with-reselect-library/
100
        index.md
101
      internal-mobility/
102
        index.mdx
103
      invisible-columns-in-sql/
104
        index.md
105
      is-void-zero-a-threat-to-open-source/
106
        index.mdx
107
      its-more-fun-to-be-competent/
108
        index.md
109
      lazy-loading-iframes/
110
        index.md
111
      make-it-work-first-before-you-optimize/
112
        index.mdx
113
      mental-toughness-is-the-best-quality-a-developer-can-have/
114
        index.md
115
      mermaid-create-charts-and-diagrams-with-markdown/
116
        index.md
117
      micro-frontends-what-they-are-and-when-to-use/
118
        index.mdx
119
      minimum-viable-documentation/
120
        index.mdx
121
      next-js-react-server-side-rendering-done-right/
122
        index.md
123
      objective-c-is-the-ugliest-programming-language-and-a-total-abomination/
124
        index.md
125
      open-dyslexic-font/
126
        index.mdx
127
      outdated-docs-are-tech-debt/
128
        index.md
129
      peaks-js-interact-with-audio-waveforms/
130
        index.md
131
      preconnect-to-required-origins/
132
        index.md
133
      react-lazy-loading/
134
        index.md
135
      react-testing-mock-service-worker/
136
        index.md
137
      repopack/
138
        index.mdx
139
      setImmediate-vs-setTimeout-in-javascript/
140
        index.md
141
      sharp-high-performance-node-js-image-processing-library/
142
        index.md
143
      small-habits-big-impact/
144
        index.mdx
145
      software-engineer-titles-have-almost-lost-all-their-meaning/
146
        index.mdx
147
      specialist-vs-generalist-choosing-your-career-path/
148
        index.mdx
149
      speculation-rules-api/
150
        index.md
151
      start-with-the-bigger-picture/
152
        index.mdx
153
      staying-motivated-while-building/
154
        index.md
155
      take-your-writing-seriously/
156
        index.md
157
      technical-debt-is-killing-your-business/
158
        index.md
159
      the-art-of-effective-onboarding/
160
        index.mdx
161
      the-barnacle-strategy/
162
        index.mdx
163
      the-credit-vacuum/
164
        index.mdx
165
      the-crutch-effect/
166
        index.md
167
      the-internet-is-becoming-an-ocean-of-LLM-generated-junk/
168
        index.md
169
      the-only-javascript-feature-that-was-deprecated/
170
        index.md
171
      the-real-cost-of-meetings/
172
        index.md
173
      the-secret-to-being-a-top-developer-is-building-things/
174
        index.md
175
      the-what-why-and-how-of-using-a-skeleton-loading-screen/
176
        index.md
177
      tips-for-reducing-cyclomatic-complexity/
178
        index.md
179
      understanding-javascript-closures/
180
        index.md
181
      understanding-vue-suspense/
182
        index.md
183
      unrealistic-deadlines-in-software-engineering/
184
        index.md
185
      users-can-be-fired/
186
        index.md
187
      week-of-coding-can-save-you-hours-of-planning/
188
        index.md
189
      what-does-an-entry-level-programmer-need-to-know-exactly/
190
        index.md
191
      what-made-apple-great/
192
        index.mdx
193
      what-makes-mrbeast-so-successful/
194
        index.md
195
      whats-holding-you-back/
196
        index.mdx
197
      whats-new-in-express-5/
198
        index.mdx
199
      when-regex-goes-wrong/
200
        index.md
201
      when-should-you-actually-worry-about-tech-debt/
202
        index.md
203
      write-documentation-like-a-journalist/
204
        index.md
205
      your-repo-is-a-leaky-ship-probably/
206
        index.md
207
    config.ts
208
  layouts/
209
    PageLayout.astro
210
  lib/
211
    utils.ts
212
  pages/
213
    blog/
214
      [...slug].astro
215
    otto/
216
      index.astro
217
    topics/
218
      [topic].astro
219
      index.astro
220
    404.astro
221
    index.astro
222
    robots.txt.ts
223
    rss.xml.ts
224
  styles/
225
    global.css
226
  utils/
227
    filterPublishedPosts.ts
228
    isDraftPost.ts
229
  consts.ts
230
  env.d.ts
231
  types.ts
232

233
================================================================
234
Repository Files
235
================================================================
236

237
================
238
File: src/components/ArrowCard.jsx
239
================
240
import React from 'react';
241
import { readingTime } from "@lib/utils";
242

243
const ArrowCardReact = ({ entry }) => {
244
  {* yeah not sharing this, haha *}

Think of Repopack like a tar designed specifically for feeding codebases to LLMs. It’s tailored for the unique requirements of working with large language models in a coding context.

Sharing the `repopack-output.txt` with large language models

Once you’ve generated your repopack-output.txt file, you can easily share it with various AI coding assistants. Here’s how it looks when used with two popular platforms:

Claude

ChatGPT

As you can see, both Claude and ChatGPT can easily process the repopack-output.txt file, allowing them to understand your entire codebase context when providing assistance.

Basic Usage

Getting started with Repopack is straightforward:

1
npm install -g repopack

Run it in your project directory:

1
repopack

To pack specific files or directories using glob patterns:

1
repopack --include "src/**/*.ts,**/*.md"

Finally, find the repopack-output.txt file in your current directory.

Key Features

Simplicity: It’s a one-command operation to package your repo.
AI-Optimized: The output is formatted for easy consumption by AI models.
Token Awareness: It provides token counts, helping you stay within AI model limits.
Customizable: You can configure what to include or exclude.
Security-Minded: It respects .gitignore files and uses Secretlint to avoid exposing sensitive info.

A Word of Caution

While Repopack is useful, it’s important to use it thoughtfully:

Security: Always double-check that sensitive information isn’t included in the output.
AI Limitations: Remember that AI tools, while powerful, aren’t infallible. Use their suggestions as a starting point, not gospel.
Context Matters: Sometimes, less is more. Consider if the AI really needs your entire codebase or just specific parts.

Use it wisely, always be mindful of security concerns, and don’t forget that your expertise and judgment are still the most valuable assets in any development project.

References

https://github.com/yamadashy/repopack

Repopack (now Repomix): Pack Your Entire Repository Into A Single File

A tool that packages your code to easily share with LLM models.

What is Repopack?

Claude

ChatGPT

Basic Usage

Key Features

A Word of Caution

References

Liked this post?

The Fight to Free JavaScript from Oracle's Control

Introducing the Legendary Programmer Hall of Fame

Secure Your Repositories: Prevent Credential Leaks with Gitleaks

Tattoos Won't Break Your Tech Career

Recursion Explained In Simple Terms

What Makes MrBeast So Successful?

Pkl: Apple's New Configuration Language That Could Replace JSON and YAML

Is Age Really a Factor in Tech?

Why I moved from Google Analytics to Simple Analytics

Repopack (now Repomix): Pack Your Entire Repository Into A Single File

A tool that packages your code to easily share with LLM models.

What is Repopack?

Sharing the repopack-output.txt with large language models

Claude

ChatGPT

Basic Usage

Key Features

A Word of Caution

References

Liked this post?

The Fight to Free JavaScript from Oracle's Control

Introducing the Legendary Programmer Hall of Fame

Secure Your Repositories: Prevent Credential Leaks with Gitleaks

Tattoos Won't Break Your Tech Career

Recursion Explained In Simple Terms

What Makes MrBeast So Successful?

Pkl: Apple's New Configuration Language That Could Replace JSON and YAML

Is Age Really a Factor in Tech?

Why I moved from Google Analytics to Simple Analytics

Sharing the `repopack-output.txt` with large language models