hgbook

view en/tour-basic.tex @ 96:7d7ddc3a57af

Split tour into two chapters.
author Bryan O'Sullivan <bos@serpentine.com>
date Fri Oct 13 15:34:54 2006 -0700 (2006-10-13)
parents 47ea206351d5
children 659fa1a2c628
line source
1 \chapter{A tour of Mercurial: the basics}
2 \label{chap:tour-basic}
4 \section{Installing Mercurial on your system}
5 \label{sec:tour:install}
7 Prebuilt binary packages of Mercurial are available for every popular
8 operating system. These make it easy to start using Mercurial on your
9 computer immediately.
11 \subsection{Linux}
13 Because each Linux distribution has its own packaging tools, policies,
14 and rate of development, it's difficult to give a comprehensive set of
15 instructions on how to install Mercurial binaries. The version of
16 Mercurial that you will end up with can vary depending on how active
17 the person is who maintains the package for your distribution.
19 To keep things simple, I will focus on installing Mercurial from the
20 command line under the most popular Linux distributions. Most of
21 these distributions provide graphical package managers that will let
22 you install Mercurial with a single click; the package name to look
23 for is \texttt{mercurial}.
25 \begin{itemize}
26 \item[Debian]
27 \begin{codesample4}
28 apt-get install mercurial
29 \end{codesample4}
31 \item[Fedora Core]
32 \begin{codesample4}
33 yum install mercurial
34 \end{codesample4}
36 \item[Gentoo]
37 \begin{codesample4}
38 emerge mercurial
39 \end{codesample4}
41 \item[OpenSUSE]
42 \begin{codesample4}
43 yum install mercurial
44 \end{codesample4}
46 \item[Ubuntu] Ubuntu's Mercurial package is particularly old, and you
47 should not use it. If you know how, you can rebuild and install the
48 Debian package. It's probably easier to build Mercurial from source
49 and simply run that; see section~\ref{sec:srcinstall:unixlike} for
50 details.
51 \end{itemize}
53 \subsection{Mac OS X}
55 Lee Cantey publishes an installer of Mercurial for Mac OS~X at
56 \url{http://mercurial.berkwood.com}. This package works on both
57 Intel-~and Power-based Macs. Before you can use it, you must install
58 a compatible version of Universal MacPython~\cite{web:macpython}. This
59 is easy to do; simply follow the instructions on Lee's site.
61 \subsection{Solaris}
63 XXX.
65 \subsection{Windows}
67 Lee Cantey publishes an installer of Mercurial for Windows at
68 \url{http://mercurial.berkwood.com}. This package has no external
69 dependencies; it ``just works''.
71 \begin{note}
72 The Windows version of Mercurial does not automatically convert line
73 endings between Windows and Unix styles. If you want to share work
74 with Unix users, you must do a little additional configuration
75 work. XXX Flesh this out.
76 \end{note}
78 \section{Getting started}
80 To begin, we'll use the \hgcmd{version} command to find out whether
81 Mercurial is actually installed properly. The actual version
82 information that it prints isn't so important; it's whether it prints
83 anything at all that we care about.
84 \interaction{tour.version}
86 \subsection{Built-in help}
88 Mercurial provides a built-in help system. This invaluable for those
89 times when you find yourself stuck trying to remember how to run a
90 command. If you are completely stuck, simply run \hgcmd{help}; it
91 will print a brief list of commands, along with a description of what
92 each does. If you ask for help on a specific command (as below), it
93 prints more detailed information.
94 \interaction{tour.help}
95 For a more impressive level of detail (which you won't usually need)
96 run \hgcmdargs{help}{\hggopt{-v}}. The \hggopt{-v} option is short
97 for \hggopt{--verbose}, and tells Mercurial to print more information
98 than it usually would.
100 \section{Working with a repository}
102 In Mercurial, everything happens inside a \emph{repository}. The
103 repository for a project contains all of the files that ``belong to''
104 that project, along with a historical record of the project's files.
106 There's nothing particularly magical about a repository; it is simply
107 a directory tree in your filesystem that Mercurial treats as special.
108 You can rename delete a repository any time you like, using either the
109 command line or your file browser.
111 \subsection{Making a local copy of a repository}
113 \emph{Copying} a repository is just a little bit special. While you
114 could use a normal file copying command to make a copy of a
115 repository, it's best to use a built-in command that Mercurial
116 provides. This command is called \hgcmd{clone}, because it creates an
117 identical copy of an existing repository.
118 \interaction{tour.clone}
119 If our clone succeeded, we should now have a local directory called
120 \dirname{hello}. This directory will contain some files.
121 \interaction{tour.ls}
122 These files have the same contents and history in our repository as
123 they do in the repository we cloned.
125 Every Mercurial repository is complete, self-contained, and
126 independent. It contains its own private copy of a project's files
127 and history. A cloned repository remembers the location of the
128 repository it was cloned from, but it does not communicate with that
129 repository, or any other, unless you tell it to.
131 What this means for now is that we're free to experiment with our
132 repository, safe in the knowledge that it's a private ``sandbox'' that
133 won't affect anyone else.
135 \subsection{What's in a repository?}
137 When we take a more detailed look inside a repository, we can see that
138 it contains a directory named \dirname{.hg}. This is where Mercurial
139 keeps all of its metadata for the repository.
140 \interaction{tour.ls-a}
142 The contents of the \dirname{.hg} directory and its subdirectories are
143 private to Mercurial. Every other file and directory in the
144 repository is yours to do with as you please.
146 To introduce a little terminology, the \dirname{.hg} directory is the
147 ``real'' repository, and all of the files and directories that coexist
148 with it are said to live in the \emph{working directory}. An easy way
149 to remember the distinction is that the \emph{repository} contains the
150 \emph{history} of your project, while the \emph{working directory}
151 contains a \emph{snapshot} of your project at a particular point in
152 history.
154 \section{A tour through history}
156 One of the first things we might want to do with a new, unfamiliar
157 repository is understand its history. The \hgcmd{log} command gives
158 us a view of history.
159 \interaction{tour.log}
160 By default, this command prints a brief paragraph of output for each
161 change to the project that was recorded. In Mercurial terminology, we
162 call each of these recorded events a \emph{changeset}, because it can
163 contain a record of changes to several files.
165 The fields in a record of output from \hgcmd{log} are as follows.
166 \begin{itemize}
167 \item[\texttt{changeset}] This field has the format of a number,
168 followed by a colon, followed by a hexadecimal string. These are
169 \emph{identifiers} for the changeset. There are two identifiers
170 because the number is shorter and easier to type than the hex
171 string.
172 \item[\texttt{user}] The identity of the person who created the
173 changeset. This is a free-form field, but it most often contains a
174 person's name and email address.
175 \item[\texttt{date}] The date and time on which the changeset was
176 created, and the timezone in which it was created. (Thef date and
177 time are local to that timezone; they display what time and date it
178 was for the person who created the changeset.)
179 \item[\texttt{summary}] The first line of the text message that the
180 creator of the changeset entered to describe the changeset.
181 \end{itemize}
182 The default output printed by \hgcmd{log} is purely a summary; it is
183 missing a lot of detail.
185 \begin{figure}[ht]
186 \centering
187 \grafix{tour-history}
188 \caption{Graphical history of the \dirname{hello} repository}
189 \label{fig:tour:history}
190 \end{figure}
192 \subsection{Changesets, revisions, and identification}
194 English being a notoriously sloppy language, we have a variety of
195 terms that have the same meaning. If you are talking about Mercurial
196 history with other people, you will find that the word ``changeset''
197 is often compressed to ``change'' or ``cset'', and sometimes a
198 changeset is referred to as a ``revision'' or a ``rev''.
200 While it doesn't matter what \emph{word} you use to refer to the
201 concept of ``a~changeset'', the \emph{identifier} that you use to
202 refer to ``a~\emph{specific} changeset'' is of great importance.
203 Recall that the \texttt{changeset} field in the output from
204 \hgcmd{log} identifies a changeset using both a number and a
205 hexadecimal string. The number is \emph{only valid in that
206 repository}, while the hex string is the \emph{permanent, unchanging
207 identifier} that will always identify that changeset in every copy
208 of the repository.
210 This distinction is important. If you send someone an email talking
211 about ``revision~33'', there's a high likelihood that their
212 revision~33 will \emph{not be the same} as yours. The reason for this
213 is that a revision number depends on the order in which changes
214 arrived in a repository, and there is no guarantee that the same
215 changes will happen in the same order in different repositories.
216 Three changes $a,b,c$ can easily appear in one repository as $0,1,2$,
217 while in another as $1,0,2$.
219 Mercurial uses revision numbers purely as a convenient shorthand. If
220 you need to discuss a changeset with someone, or make a record of a
221 changeset for some other reason (for example, in a bug report), use
222 the hexadecimal identifier.
224 \subsection{Viewing specific revisions}
226 To narrow the output of \hgcmd{log} down to a single revision, use the
227 \hgopt{log}{-r} (or \hgopt{log}{--rev}) option. You can use either a
228 revision number or a long-form changeset identifier, and you can
229 provide as many revisions as you want. \interaction{tour.log-r}
231 If you want to see the history of several revisions without having to
232 list each one, you can use \emph{range notation}; this lets you
233 express the idea ``I want all revisions between $a$ and $b$,
234 inclusive''.
235 \interaction{tour.log.range}
236 Mercurial also honours the order in which you specify revisions, so
237 \hgcmdargs{log}{-r 2:4} prints $2,3,4$ while \hgcmdargs{log}{-r 4:2}
238 prints $4,3,2$.
240 \subsection{More detailed information}
242 While the summary information printed by \hgcmd{log} is useful if you
243 already know what you're looking for, you may need to see a complete
244 description of the change, or a list of the files changed, if you're
245 trying to decide whether a changeset is the one you're looking for.
246 The \hgcmd{log} command's \hggopt{-v} (or \hggopt{--verbose})
247 option gives you this extra detail.
248 \interaction{tour.log-v}
250 If you want to see both the description and content of a change, add
251 the \hgopt{log}{-p} (or \hgopt{log}{--patch}) option. This displays
252 the content of a change as a \emph{unified diff} (if you've never seen
253 a unified diff before, see section~\ref{sec:mq:patch} for an overview).
254 \interaction{tour.log-vp}
256 \section{All about command options}
258 Let's take a brief break from exploring Mercurial commands to discuss
259 a pattern in the way that they work; you may find this useful to keep
260 in mind as we continiue our tour.
262 Mercurial has a consistent and straightforward approach to dealing
263 with the options that you can pass to commands. It follows the
264 conventions for options that are common to modern Linux and Unix
265 systems.
266 \begin{itemize}
267 \item Every option has a long name. For example, as we've already
268 seen, the \hgcmd{log} command accepts a \hgopt{log}{--rev} option.
269 \item Most options have short names, too. Instead of
270 \hgopt{log}{--rev}, we can use \hgopt{log}{-r}. (The reason that
271 some options don't have short names is that the options in question
272 are rarely used.)
273 \item Long options start with two dashes (e.g.~\hgopt{log}{--rev}),
274 while short options start with one (e.g.~\hgopt{log}{-r}).
275 \item Option naming and usage is consistent across commands. For
276 example, every command that lets you specify a changeset~ID or
277 revision number accepts both \hgopt{log}{-r} and \hgopt{log}{--rev}
278 arguments.
279 \end{itemize}
280 In the examples throughout this book, I use short options instead of
281 long. This just reflects my own preference, so don't read anything
282 significant into it.
284 Most commands that print output of some kind will print more output
285 when passed a \hggopt{-v} (or \hggopt{--verbose}) option, and less
286 when passed \hggopt{-q} (or \hggopt{--quiet}).
288 \section{Making and reviewing changes}
290 Now that we have a grasp of viewing history in Mercurial, let's take a
291 look at making some changes and examining them.
293 The first thing we'll do is isolate our experiment in a repository of
294 its own. We use the \hgcmd{clone} command, but we don't need to
295 clone a copy of the remote repository. Since we already have a copy
296 of it locally, we can just clone that instead. This is much faster
297 than cloning over the network, and cloning a local repository uses
298 less disk space in most cases, too.
299 \interaction{tour.reclone}
300 As an aside, it's often good practice to keep a ``pristine'' copy of a
301 remote repository around, which you can then make temporary clones of
302 to create sandboxes for each task you want to work on. This lets you
303 work on multiple tasks in parallel, each isolated from the others
304 until it's complete and you're ready to integrate it back. Because
305 local clones are so cheap, there's almost no overhead to cloning and
306 destroying repositories whenever you want.
308 In our \dirname{my-hello} repository, we have a file
309 \filename{hello.c} that contains the classic ``hello, world'' program.
310 Let's use the ancient and venerable \command{sed} command to edit this
311 file so that it prints a second line of output. (I'm only using
312 \command{sed} to do this because it's easy to write a scripted example
313 this way. Since you're not under the same constraint, you probably
314 won't want to use \command{sed}; simply use your preferred text editor to
315 do the same thing.)
316 \interaction{tour.sed}
318 Mercurial's \hgcmd{status} command will tell us what Mercurial knows
319 about the files in the repository.
320 \interaction{tour.status}
321 The \hgcmd{status} command prints no output for some files, but a line
322 starting with ``\texttt{M}'' for \filename{hello.c}. Unless you tell
323 it to, \hgcmd{status} will not print any output for files that have
324 not been modified.
326 The ``\texttt{M}'' indicates that Mercurial has noticed that we
327 modified \filename{hello.c}. Notice that we didn't need to
328 \emph{inform} Mercurial that we were going to modify the file before
329 we started, or that we had modified the file after we were done; it
330 was able to figure this out itself.
332 It's a little bit helpful to know that we've modified
333 \filename{hello.c}, but we might prefer to know exactly \emph{what}
334 changes we've made to it. To do this, we use the \hgcmd{diff}
335 command.
336 \interaction{tour.diff}
338 \section{Recording changes in a new changeset}
340 We can modify files, build and test our changes, and use
341 \hgcmd{status} and \hgcmd{diff} to review our changes, until we're
342 satisfied with what we've done and arrive at a natural stopping point
343 where we want to record our work in a new changeset.
345 The \hgcmd{commit} command lets us create a new changeset; we'll
346 usually refer to this as ``making a commit'' or ``committing''.
348 \subsection{Writing a commit message}
350 When we commit a change, Mercurial drops us into a text editor, to
351 enter a message that will describe the modifications we've made in
352 this changeset. This is called the \emph{commit message}. It will be
353 a record for readers of what we did and why, and it will be printed by
354 \hgcmd{log} after we've finished committing.
355 \interaction{tour.commit}
357 The editor that the \hgcmd{commit} command drops us into will contain
358 an empty line, followed by a number of lines starting with
359 ``\texttt{HG:}''.
360 \begin{codesample2}
361 \emph{empty line}
362 HG: changed hello.c
363 \end{codesample2}
364 Mercurial ignores the lines that start with ``\texttt{HG:}''; it uses
365 them only to tell us which files it's recording changes to. Modifying
366 or deleting these lines has no effect.
368 \subsection{Writing a good commit message}
370 Since \hgcmd{log} only prints the first line of a commit message by
371 default, it's best to write a commit message whose first line stands
372 alone. Here's a real example of a commit message that \emph{doesn't}
373 follow this guideline, and hence has a summary that is not readable.
374 \begin{codesample2}
375 changeset: 73:584af0e231be
376 user: Censored Person <censored.person@example.org>
377 date: Tue Sep 26 21:37:07 2006 -0700
378 summary: include buildmeister/commondefs. Add an exports and install
379 \end{codesample2}
381 As far as the remainder of the contents of the commit message are
382 concerned, there are no hard-and-fast rules. Mercurial itself doesn't
383 interpret or care about the contents of the commit message, though
384 your project may have policies that dictate a certain kind of
385 formatting.
387 My personal preference is for short, but informative, commit messages
388 that tell me something that I can't figure out with a quick glance at
389 the output of \hgcmdargs{log}{--patch}.
391 \subsection{Aborting a commit}
393 If you decide that you don't want to commit while in the middle of
394 editing a commit message, simply exit from your editor without saving
395 the file that it's editing. This will cause nothing to happen to
396 either the repository or the working directory.
398 If we run the \hgcmd{commit} command without any arguments, it records
399 all of the changes we've made, as reported by \hgcmd{status} and
400 \hgcmd{diff}.
402 \subsection{Admiring our new handywork}
404 Once we've finished the commit, we can use the \hgcmd{tip} command to
405 display the changeset we just created. This command produces output
406 that is identical to \hgcmd{log}, but it only displays the newest
407 revision in the repository.
408 \interaction{tour.tip}
409 We refer to the newest revision in the repository as the tip revision,
410 or simply the tip.
412 \section{Sharing changes}
414 We mentioned earlier that repositories in Mercurial are
415 self-contained. This means that the changeset we just created exists
416 only in our \dirname{my-hello} repository. Let's look at a few ways
417 that we can propagate this change into other repositories.
419 \subsection{Pulling changes from another repository}
420 \label{sec:tour:pull}
422 To get started, let's clone our original \dirname{hello} repository,
423 which does not contain the change we just committed. We'll call our
424 temporary repository \dirname{hello-pull}.
425 \interaction{tour.clone-pull}
427 We'll use the \hgcmd{pull} command to bring changes from
428 \dirname{my-hello} into \dirname{hello-pull}. However, blindly
429 pulling unknown changes into a repository is a somewhat scary
430 prospect. Mercurial provides the \hgcmd{incoming} command to tell us
431 what changes the \hgcmd{pull} command \emph{would} pull into the
432 repository, without actually pulling the changes in.
433 \interaction{tour.incoming}
434 (Of course, someone could cause more changesets to appear in the
435 repository that we ran \hgcmd{incoming} in, before we get a chance to
436 \hgcmd{pull} the changes, so that we could end up pulling changes that we
437 didn't expect.)
439 Bringing changes into a repository is a simple matter of running the
440 \hgcmd{pull} command, and telling it which repository to pull from.
441 \interaction{tour.pull}
442 As you can see from the before-and-after output of \hgcmd{tip}, we
443 have successfully pulled changes into our repository. There remains
444 one step before we can see these changes in the working directory.
446 \subsection{Updating the working directory}
448 We have so far glossed over the relationship between a repository and
449 its working directory. The \hgcmd{pull} command that we ran in
450 section~\ref{sec:tour:pull} brought changes into the repository, but
451 if we check, there's no sign of those changes in the working
452 directory. This is because \hgcmd{pull} does not (by default) touch
453 the working directory. Instead, we use the \hgcmd{update} command to
454 do this.
455 \interaction{tour.update}
457 It might seem a bit strange that \hgcmd{pull} doesn't update the
458 working directory automatically. There's actually a good reason for
459 this: you can use \hgcmd{update} to update the working directory to
460 the state it was in at \emph{any revision} in the history of the
461 repository. If you had the working directory updated to an old
462 revision---to hunt down the origin of a bug, say---and ran a
463 \hgcmd{pull} which automatically updated the working directory to a
464 new revision, you might not be terribly happy.
466 However, since pull-then-update is such a common thing to do,
467 Mercurial lets you combine the two by passing the \hgopt{pull}{-u}
468 option to \hgcmd{pull}.
469 \begin{codesample2}
470 hg pull -u
471 \end{codesample2}
472 If you look back at the output of \hgcmd{pull} in
473 section~\ref{sec:tour:pull} when we ran it without \hgopt{pull}{-u},
474 you can see that it printed a helpful reminder that we'd have to take
475 an explicit step to update the working directory:
476 \begin{codesample2}
477 (run 'hg update' to get a working copy)
478 \end{codesample2}
480 To find out what revision the working directory is at, use the
481 \hgcmd{parents} command.
482 \interaction{tour.parents}
483 To update the working directory to a particular revision, give a
484 revision number or changeset~ID to the \hgcmd{update} command.
485 \interaction{tour.older}
486 If you omit an explicit revision, \hgcmd{update} will update to the
487 tip revision, as shown by the second call to \hgcmd{update} in the
488 example above.
490 \subsection{Pushing changes to another repository}
492 Mercurial lets us push changes to another repository, from the
493 repository we're currently visiting. As with the example of
494 \hgcmd{pull} above, we'll create a temporary repository to push our
495 changes into.
496 \interaction{tour.clone-push}
497 The \hgcmd{outgoing} command tells us what changes would be pushed
498 into another repository.
499 \interaction{tour.outgoing}
500 And the \hgcmd{push} command does the actual push.
501 \interaction{tour.push}
502 As with \hgcmd{pull}, the \hgcmd{push} command does not update the
503 working directory in the repository that it's pushing changes into.
504 (Unlike \hgcmd{pull}, \hgcmd{push} does not provide a \texttt{-u}
505 option that updates the other repository's working directory.)
507 What happens if we try to pull or push changes and the receiving
508 repository already has those changes? Nothing too exciting.
509 \interaction{tour.push.nothing}
511 \subsection{Sharing changes over a network}
513 The commands we have covered in the previous few sections are not
514 limited to working with local repositories. Each works in exactly the
515 same fashion over a network connection; simply pass in a URL instead
516 of a local path.
517 \interaction{tour.outgoing.net}
518 In this example, we can see what changes we could push to the remote
519 repository, but the repository is understandably not set up to let
520 anonymous users push to it.
521 \interaction{tour.push.net}
523 %%% Local Variables:
524 %%% mode: latex
525 %%% TeX-master: "00book"
526 %%% End: