hgbook: 7d7ddc3a57af en/tour-basic.tex

hgbook

view en/tour-basic.tex @ 96:7d7ddc3a57af

Split tour into two chapters.

author	Bryan O'Sullivan <bos@serpentine.com>
date	Fri Oct 13 15:34:54 2006 -0700 (2006-10-13)
parents	47ea206351d5
children	659fa1a2c628

line source

1 \chapter{A tour of Mercurial: the basics}

2 \label{chap:tour-basic}

4 \section{Installing Mercurial on your system}

5 \label{sec:tour:install}

7 Prebuilt binary packages of Mercurial are available for every popular

8 operating system. These make it easy to start using Mercurial on your

9 computer immediately.

11 \subsection{Linux}

13 Because each Linux distribution has its own packaging tools, policies,

14 and rate of development, it's difficult to give a comprehensive set of

15 instructions on how to install Mercurial binaries. The version of

16 Mercurial that you will end up with can vary depending on how active

17 the person is who maintains the package for your distribution.

19 To keep things simple, I will focus on installing Mercurial from the

20 command line under the most popular Linux distributions. Most of

21 these distributions provide graphical package managers that will let

22 you install Mercurial with a single click; the package name to look

23 for is \texttt{mercurial}.

25 \begin{itemize}

26 \item[Debian]

27 \begin{codesample4}

28 apt-get install mercurial

29 \end{codesample4}

31 \item[Fedora Core]

32 \begin{codesample4}

33 yum install mercurial

34 \end{codesample4}

36 \item[Gentoo]

37 \begin{codesample4}

38 emerge mercurial

39 \end{codesample4}

41 \item[OpenSUSE]

42 \begin{codesample4}

43 yum install mercurial

44 \end{codesample4}

46 \item[Ubuntu] Ubuntu's Mercurial package is particularly old, and you

47 should not use it. If you know how, you can rebuild and install the

48 Debian package. It's probably easier to build Mercurial from source

49 and simply run that; see section~\ref{sec:srcinstall:unixlike} for

50 details.

51 \end{itemize}

53 \subsection{Mac OS X}

55 Lee Cantey publishes an installer of Mercurial for Mac OS~X at

56 \url{http://mercurial.berkwood.com}. This package works on both

57 Intel-~and Power-based Macs. Before you can use it, you must install

58 a compatible version of Universal MacPython~\cite{web:macpython}. This

59 is easy to do; simply follow the instructions on Lee's site.

61 \subsection{Solaris}

63 XXX.

65 \subsection{Windows}

67 Lee Cantey publishes an installer of Mercurial for Windows at

68 \url{http://mercurial.berkwood.com}. This package has no external

69 dependencies; it ``just works''.

71 \begin{note}

72 The Windows version of Mercurial does not automatically convert line

73 endings between Windows and Unix styles. If you want to share work

74 with Unix users, you must do a little additional configuration

75 work. XXX Flesh this out.

76 \end{note}

78 \section{Getting started}

80 To begin, we'll use the \hgcmd{version} command to find out whether

81 Mercurial is actually installed properly. The actual version

82 information that it prints isn't so important; it's whether it prints

83 anything at all that we care about.

84 \interaction{tour.version}

86 \subsection{Built-in help}

88 Mercurial provides a built-in help system. This invaluable for those

89 times when you find yourself stuck trying to remember how to run a

90 command. If you are completely stuck, simply run \hgcmd{help}; it

91 will print a brief list of commands, along with a description of what

92 each does. If you ask for help on a specific command (as below), it

93 prints more detailed information.

94 \interaction{tour.help}

95 For a more impressive level of detail (which you won't usually need)

96 run \hgcmdargs{help}{\hggopt{-v}}. The \hggopt{-v} option is short

97 for \hggopt{--verbose}, and tells Mercurial to print more information

98 than it usually would.

100 \section{Working with a repository}

101

102 In Mercurial, everything happens inside a \emph{repository}. The

103 repository for a project contains all of the files that ``belong to''

104 that project, along with a historical record of the project's files.

105

106 There's nothing particularly magical about a repository; it is simply

107 a directory tree in your filesystem that Mercurial treats as special.

108 You can rename delete a repository any time you like, using either the

109 command line or your file browser.

110

111 \subsection{Making a local copy of a repository}

112

113 \emph{Copying} a repository is just a little bit special. While you

114 could use a normal file copying command to make a copy of a

115 repository, it's best to use a built-in command that Mercurial

116 provides. This command is called \hgcmd{clone}, because it creates an

117 identical copy of an existing repository.

118 \interaction{tour.clone}

119 If our clone succeeded, we should now have a local directory called

120 \dirname{hello}. This directory will contain some files.

121 \interaction{tour.ls}

122 These files have the same contents and history in our repository as

123 they do in the repository we cloned.

124

125 Every Mercurial repository is complete, self-contained, and

126 independent. It contains its own private copy of a project's files

127 and history. A cloned repository remembers the location of the

128 repository it was cloned from, but it does not communicate with that

129 repository, or any other, unless you tell it to.

130

131 What this means for now is that we're free to experiment with our

132 repository, safe in the knowledge that it's a private ``sandbox'' that

133 won't affect anyone else.

134

135 \subsection{What's in a repository?}

136

137 When we take a more detailed look inside a repository, we can see that

138 it contains a directory named \dirname{.hg}. This is where Mercurial

139 keeps all of its metadata for the repository.

140 \interaction{tour.ls-a}

141

142 The contents of the \dirname{.hg} directory and its subdirectories are

143 private to Mercurial. Every other file and directory in the

144 repository is yours to do with as you please.

145

146 To introduce a little terminology, the \dirname{.hg} directory is the

147 ``real'' repository, and all of the files and directories that coexist

148 with it are said to live in the \emph{working directory}. An easy way

149 to remember the distinction is that the \emph{repository} contains the

150 \emph{history} of your project, while the \emph{working directory}

151 contains a \emph{snapshot} of your project at a particular point in

152 history.

153

154 \section{A tour through history}

155

156 One of the first things we might want to do with a new, unfamiliar

157 repository is understand its history. The \hgcmd{log} command gives

158 us a view of history.

159 \interaction{tour.log}

160 By default, this command prints a brief paragraph of output for each

161 change to the project that was recorded. In Mercurial terminology, we

162 call each of these recorded events a \emph{changeset}, because it can

163 contain a record of changes to several files.

164

165 The fields in a record of output from \hgcmd{log} are as follows.

166 \begin{itemize}

167 \item[\texttt{changeset}] This field has the format of a number,

168 followed by a colon, followed by a hexadecimal string. These are

169 \emph{identifiers} for the changeset. There are two identifiers

170 because the number is shorter and easier to type than the hex

171 string.

172 \item[\texttt{user}] The identity of the person who created the

173 changeset. This is a free-form field, but it most often contains a

174 person's name and email address.

175 \item[\texttt{date}] The date and time on which the changeset was

176 created, and the timezone in which it was created. (Thef date and

177 time are local to that timezone; they display what time and date it

178 was for the person who created the changeset.)

179 \item[\texttt{summary}] The first line of the text message that the

180 creator of the changeset entered to describe the changeset.

181 \end{itemize}

182 The default output printed by \hgcmd{log} is purely a summary; it is

183 missing a lot of detail.

184

185 \begin{figure}[ht]

186 \centering

187 \grafix{tour-history}

188 \caption{Graphical history of the \dirname{hello} repository}

189 \label{fig:tour:history}

190 \end{figure}

191

192 \subsection{Changesets, revisions, and identification}

193

194 English being a notoriously sloppy language, we have a variety of

195 terms that have the same meaning. If you are talking about Mercurial

196 history with other people, you will find that the word ``changeset''

197 is often compressed to ``change'' or ``cset'', and sometimes a

198 changeset is referred to as a ``revision'' or a ``rev''.

199

200 While it doesn't matter what \emph{word} you use to refer to the

201 concept of ``a~changeset'', the \emph{identifier} that you use to

202 refer to ``a~\emph{specific} changeset'' is of great importance.

203 Recall that the \texttt{changeset} field in the output from

204 \hgcmd{log} identifies a changeset using both a number and a

205 hexadecimal string. The number is \emph{only valid in that

206 repository}, while the hex string is the \emph{permanent, unchanging

207 identifier} that will always identify that changeset in every copy

208 of the repository.

209

210 This distinction is important. If you send someone an email talking

211 about ``revision~33'', there's a high likelihood that their

212 revision~33 will \emph{not be the same} as yours. The reason for this

213 is that a revision number depends on the order in which changes

214 arrived in a repository, and there is no guarantee that the same

215 changes will happen in the same order in different repositories.

216 Three changes $a,b,c$ can easily appear in one repository as $0,1,2$,

217 while in another as $1,0,2$.

218

219 Mercurial uses revision numbers purely as a convenient shorthand. If

220 you need to discuss a changeset with someone, or make a record of a

221 changeset for some other reason (for example, in a bug report), use

222 the hexadecimal identifier.

223

224 \subsection{Viewing specific revisions}

225

226 To narrow the output of \hgcmd{log} down to a single revision, use the

227 \hgopt{log}{-r} (or \hgopt{log}{--rev}) option. You can use either a

228 revision number or a long-form changeset identifier, and you can

229 provide as many revisions as you want. \interaction{tour.log-r}

230

231 If you want to see the history of several revisions without having to

232 list each one, you can use \emph{range notation}; this lets you

233 express the idea ``I want all revisions between $a$ and $b$,

234 inclusive''.

235 \interaction{tour.log.range}

236 Mercurial also honours the order in which you specify revisions, so

237 \hgcmdargs{log}{-r 2:4} prints $2,3,4$ while \hgcmdargs{log}{-r 4:2}

238 prints $4,3,2$.

239

240 \subsection{More detailed information}

241

242 While the summary information printed by \hgcmd{log} is useful if you

243 already know what you're looking for, you may need to see a complete

244 description of the change, or a list of the files changed, if you're

245 trying to decide whether a changeset is the one you're looking for.

246 The \hgcmd{log} command's \hggopt{-v} (or \hggopt{--verbose})

247 option gives you this extra detail.

248 \interaction{tour.log-v}

249

250 If you want to see both the description and content of a change, add

251 the \hgopt{log}{-p} (or \hgopt{log}{--patch}) option. This displays

252 the content of a change as a \emph{unified diff} (if you've never seen

253 a unified diff before, see section~\ref{sec:mq:patch} for an overview).

254 \interaction{tour.log-vp}

255

256 \section{All about command options}

257

258 Let's take a brief break from exploring Mercurial commands to discuss

259 a pattern in the way that they work; you may find this useful to keep

260 in mind as we continiue our tour.

261

262 Mercurial has a consistent and straightforward approach to dealing

263 with the options that you can pass to commands. It follows the

264 conventions for options that are common to modern Linux and Unix

265 systems.

266 \begin{itemize}

267 \item Every option has a long name. For example, as we've already

268 seen, the \hgcmd{log} command accepts a \hgopt{log}{--rev} option.

269 \item Most options have short names, too. Instead of

270 \hgopt{log}{--rev}, we can use \hgopt{log}{-r}. (The reason that

271 some options don't have short names is that the options in question

272 are rarely used.)

273 \item Long options start with two dashes (e.g.~\hgopt{log}{--rev}),

274 while short options start with one (e.g.~\hgopt{log}{-r}).

275 \item Option naming and usage is consistent across commands. For

276 example, every command that lets you specify a changeset~ID or

277 revision number accepts both \hgopt{log}{-r} and \hgopt{log}{--rev}

278 arguments.

279 \end{itemize}

280 In the examples throughout this book, I use short options instead of

281 long. This just reflects my own preference, so don't read anything

282 significant into it.

283

284 Most commands that print output of some kind will print more output

285 when passed a \hggopt{-v} (or \hggopt{--verbose}) option, and less

286 when passed \hggopt{-q} (or \hggopt{--quiet}).

287

288 \section{Making and reviewing changes}

289

290 Now that we have a grasp of viewing history in Mercurial, let's take a

291 look at making some changes and examining them.

292

293 The first thing we'll do is isolate our experiment in a repository of

294 its own. We use the \hgcmd{clone} command, but we don't need to

295 clone a copy of the remote repository. Since we already have a copy

296 of it locally, we can just clone that instead. This is much faster

297 than cloning over the network, and cloning a local repository uses

298 less disk space in most cases, too.

299 \interaction{tour.reclone}

300 As an aside, it's often good practice to keep a ``pristine'' copy of a

301 remote repository around, which you can then make temporary clones of

302 to create sandboxes for each task you want to work on. This lets you

303 work on multiple tasks in parallel, each isolated from the others

304 until it's complete and you're ready to integrate it back. Because

305 local clones are so cheap, there's almost no overhead to cloning and

306 destroying repositories whenever you want.

307

308 In our \dirname{my-hello} repository, we have a file

309 \filename{hello.c} that contains the classic ``hello, world'' program.

310 Let's use the ancient and venerable \command{sed} command to edit this

311 file so that it prints a second line of output. (I'm only using

312 \command{sed} to do this because it's easy to write a scripted example

313 this way. Since you're not under the same constraint, you probably

314 won't want to use \command{sed}; simply use your preferred text editor to

315 do the same thing.)

316 \interaction{tour.sed}

317

318 Mercurial's \hgcmd{status} command will tell us what Mercurial knows

319 about the files in the repository.

320 \interaction{tour.status}

321 The \hgcmd{status} command prints no output for some files, but a line

322 starting with ``\texttt{M}'' for \filename{hello.c}. Unless you tell

323 it to, \hgcmd{status} will not print any output for files that have

324 not been modified.

325

326 The ``\texttt{M}'' indicates that Mercurial has noticed that we

327 modified \filename{hello.c}. Notice that we didn't need to

328 \emph{inform} Mercurial that we were going to modify the file before

329 we started, or that we had modified the file after we were done; it

330 was able to figure this out itself.

331

332 It's a little bit helpful to know that we've modified

333 \filename{hello.c}, but we might prefer to know exactly \emph{what}

334 changes we've made to it. To do this, we use the \hgcmd{diff}

335 command.

336 \interaction{tour.diff}

337

338 \section{Recording changes in a new changeset}

339

340 We can modify files, build and test our changes, and use

341 \hgcmd{status} and \hgcmd{diff} to review our changes, until we're

342 satisfied with what we've done and arrive at a natural stopping point

343 where we want to record our work in a new changeset.

344

345 The \hgcmd{commit} command lets us create a new changeset; we'll

346 usually refer to this as ``making a commit'' or ``committing''.

347

348 \subsection{Writing a commit message}

349

350 When we commit a change, Mercurial drops us into a text editor, to

351 enter a message that will describe the modifications we've made in

352 this changeset. This is called the \emph{commit message}. It will be

353 a record for readers of what we did and why, and it will be printed by

354 \hgcmd{log} after we've finished committing.

355 \interaction{tour.commit}

356

357 The editor that the \hgcmd{commit} command drops us into will contain

358 an empty line, followed by a number of lines starting with

359 ``\texttt{HG:}''.

360 \begin{codesample2}

361 \emph{empty line}

362 HG: changed hello.c

363 \end{codesample2}

364 Mercurial ignores the lines that start with ``\texttt{HG:}''; it uses

365 them only to tell us which files it's recording changes to. Modifying

366 or deleting these lines has no effect.

367

368 \subsection{Writing a good commit message}

369

370 Since \hgcmd{log} only prints the first line of a commit message by

371 default, it's best to write a commit message whose first line stands

372 alone. Here's a real example of a commit message that \emph{doesn't}

373 follow this guideline, and hence has a summary that is not readable.

374 \begin{codesample2}

375 changeset: 73:584af0e231be

376 user: Censored Person <censored.person@example.org>

377 date: Tue Sep 26 21:37:07 2006 -0700

378 summary: include buildmeister/commondefs. Add an exports and install

379 \end{codesample2}

380

381 As far as the remainder of the contents of the commit message are

382 concerned, there are no hard-and-fast rules. Mercurial itself doesn't

383 interpret or care about the contents of the commit message, though

384 your project may have policies that dictate a certain kind of

385 formatting.

386

387 My personal preference is for short, but informative, commit messages

388 that tell me something that I can't figure out with a quick glance at

389 the output of \hgcmdargs{log}{--patch}.

390

391 \subsection{Aborting a commit}

392

393 If you decide that you don't want to commit while in the middle of

394 editing a commit message, simply exit from your editor without saving

395 the file that it's editing. This will cause nothing to happen to

396 either the repository or the working directory.

397

398 If we run the \hgcmd{commit} command without any arguments, it records

399 all of the changes we've made, as reported by \hgcmd{status} and

400 \hgcmd{diff}.

401

402 \subsection{Admiring our new handywork}

403

404 Once we've finished the commit, we can use the \hgcmd{tip} command to

405 display the changeset we just created. This command produces output

406 that is identical to \hgcmd{log}, but it only displays the newest

407 revision in the repository.

408 \interaction{tour.tip}

409 We refer to the newest revision in the repository as the tip revision,

410 or simply the tip.

411

412 \section{Sharing changes}

413

414 We mentioned earlier that repositories in Mercurial are

415 self-contained. This means that the changeset we just created exists

416 only in our \dirname{my-hello} repository. Let's look at a few ways

417 that we can propagate this change into other repositories.

418

419 \subsection{Pulling changes from another repository}

420 \label{sec:tour:pull}

421

422 To get started, let's clone our original \dirname{hello} repository,

423 which does not contain the change we just committed. We'll call our

424 temporary repository \dirname{hello-pull}.

425 \interaction{tour.clone-pull}

426

427 We'll use the \hgcmd{pull} command to bring changes from

428 \dirname{my-hello} into \dirname{hello-pull}. However, blindly

429 pulling unknown changes into a repository is a somewhat scary

430 prospect. Mercurial provides the \hgcmd{incoming} command to tell us

431 what changes the \hgcmd{pull} command \emph{would} pull into the

432 repository, without actually pulling the changes in.

433 \interaction{tour.incoming}

434 (Of course, someone could cause more changesets to appear in the

435 repository that we ran \hgcmd{incoming} in, before we get a chance to

436 \hgcmd{pull} the changes, so that we could end up pulling changes that we

437 didn't expect.)

438

439 Bringing changes into a repository is a simple matter of running the

440 \hgcmd{pull} command, and telling it which repository to pull from.

441 \interaction{tour.pull}

442 As you can see from the before-and-after output of \hgcmd{tip}, we

443 have successfully pulled changes into our repository. There remains

444 one step before we can see these changes in the working directory.

445

446 \subsection{Updating the working directory}

447

448 We have so far glossed over the relationship between a repository and

449 its working directory. The \hgcmd{pull} command that we ran in

450 section~\ref{sec:tour:pull} brought changes into the repository, but

451 if we check, there's no sign of those changes in the working

452 directory. This is because \hgcmd{pull} does not (by default) touch

453 the working directory. Instead, we use the \hgcmd{update} command to

454 do this.

455 \interaction{tour.update}

456

457 It might seem a bit strange that \hgcmd{pull} doesn't update the

458 working directory automatically. There's actually a good reason for

459 this: you can use \hgcmd{update} to update the working directory to

460 the state it was in at \emph{any revision} in the history of the

461 repository. If you had the working directory updated to an old

462 revision---to hunt down the origin of a bug, say---and ran a

463 \hgcmd{pull} which automatically updated the working directory to a

464 new revision, you might not be terribly happy.

465

466 However, since pull-then-update is such a common thing to do,

467 Mercurial lets you combine the two by passing the \hgopt{pull}{-u}

468 option to \hgcmd{pull}.

469 \begin{codesample2}

470 hg pull -u

471 \end{codesample2}

472 If you look back at the output of \hgcmd{pull} in

473 section~\ref{sec:tour:pull} when we ran it without \hgopt{pull}{-u},

474 you can see that it printed a helpful reminder that we'd have to take

475 an explicit step to update the working directory:

476 \begin{codesample2}

477 (run 'hg update' to get a working copy)

478 \end{codesample2}

479

480 To find out what revision the working directory is at, use the

481 \hgcmd{parents} command.

482 \interaction{tour.parents}

483 To update the working directory to a particular revision, give a

484 revision number or changeset~ID to the \hgcmd{update} command.

485 \interaction{tour.older}

486 If you omit an explicit revision, \hgcmd{update} will update to the

487 tip revision, as shown by the second call to \hgcmd{update} in the

488 example above.

489

490 \subsection{Pushing changes to another repository}

491

492 Mercurial lets us push changes to another repository, from the

493 repository we're currently visiting. As with the example of

494 \hgcmd{pull} above, we'll create a temporary repository to push our

495 changes into.

496 \interaction{tour.clone-push}

497 The \hgcmd{outgoing} command tells us what changes would be pushed

498 into another repository.

499 \interaction{tour.outgoing}

500 And the \hgcmd{push} command does the actual push.

501 \interaction{tour.push}

502 As with \hgcmd{pull}, the \hgcmd{push} command does not update the

503 working directory in the repository that it's pushing changes into.

504 (Unlike \hgcmd{pull}, \hgcmd{push} does not provide a \texttt{-u}

505 option that updates the other repository's working directory.)

506

507 What happens if we try to pull or push changes and the receiving

508 repository already has those changes? Nothing too exciting.

509 \interaction{tour.push.nothing}

510

511 \subsection{Sharing changes over a network}

512

513 The commands we have covered in the previous few sections are not

514 limited to working with local repositories. Each works in exactly the

515 same fashion over a network connection; simply pass in a URL instead

516 of a local path.

517 \interaction{tour.outgoing.net}

518 In this example, we can see what changes we could push to the remote

519 repository, but the repository is understandably not set up to let

520 anonymous users push to it.

521 \interaction{tour.push.net}

522

523 %%% Local Variables:

524 %%% mode: latex

525 %%% TeX-master: "00book"

526 %%% End: