Name: GREPはどこから来たのか - Computerphile (Where GREP Came From - Computerphile)
Uploaded: 2021-01-14T10:13:26.000Z
Duration: 10 min 7 s
Description: VoiceTubeの動画で発音を聞きながら英語表現を覚えよう！学べる英語：

過去形

I thought today maybe we would talk about 'grep', a well-known command  in the UNIX world. Something that's been around since the early

to 不定詞

1970s. What 'grep' lets you do is to search for  patterns of text - arbitrary patterns of text in

法助動詞 A2

one or more files and there could be an unbounded number of

files of input. Or the input could be coming from some other

program, for example as it is if you're using Unix pipelines.

So you take some program and you pipe it into 'grep' and  that way, no matter what the amount of input is, 'grep' can

filter out, or show you, the things that you're interested in.

And that's stuff that you can't do with a text  editor very conveniently - if at all.

One of the issues with 'grep' has always been:

And so I thought, perhaps, I could tell that story, if it would be  of any interest and we'll see where we go from there.

The way it came about - you have to put yourself back in the  early days of computing, before everybody present in this room,

1970-71 -- the very, very, early days of UNIX.

The computer that UNIX ran on was a PDP 11. At that point

it was probably an 11/20. It was a machine that had very  very little computing power. It didn't run very fast.

maybe 64K bytes and that's 64 Kbytes, not megabytes.

And very small secondary storage as well, you know  a few megabytes of disk and things like that.

So, very very limited computing resources and that meant  that a lot of the software that was in early days of UNIX

tended to be fairly simple and straightforward.

And, that reflected not only the sort of ... the relative 'wimpiness' of the hardware but also the personal tastes of the people doing the work,

primarily Ken Thompson and Dennis Ritchie.

So one of the prop ... one of the standard programs that  people use is the text editor on any system

The UNIX text editor was called 'ed', and it's not pronounced 'edd'

At least by those in the know, it's pronounced 'ee dee'.

and I think it was a, basically, stripped-down version of an

editor called QED, which Ken had worked with  and done a lot of work on earlier.

editor and the thing that you have to remember  is that, in those days, in addition

you didn't have actual video display terminals -

not of the sort that we're used to today, or  even 10 or 20 years ago.

But in fact all the computing, all of your  editing and so on, was done on paper

you can see paper! This meant that there were a lot of  things that tried to minimize the use of paper.

It also meant that editors worked one line at  a time, or multiple lines at a time,

but there was no cursor addressing, so you  couldn't move around within a line.

And so the 'ed' text editor reflected that kind of thing.

Maybe what I should do is just a quick look at what 'ed' looked like?  so the commands for 'ed' were single-letter commands.

So, for example, there was a command called 'p',

Which stood for 'print'; there was a command called 'd', which would delete a line

There was a command called 's', which took a little bit ... which  said 'substitute' so you could change this

y'know, 'ABC' into 'DEF', or something like that.

There was an 'append' command that simply said 'add some more text' and  you could add a bunch of lines and then terminate it with something.

so that you could read information from a file, and there was  a 'write' command [so]

that you could put it back in a file. a handful of other  things like that. So that was the essence of what it did.

One of the things that 'ed' did very nicely was that,  OK, these apply by default to the current line

But what do you do when you want to have more  specification of what lines you're operating on?

And so you could say things like 'line 1 to line 10 print'

So, this would print the first to 10 lines. 10 was that.

But suppose you wanted to print all of the lines in the file?

So there was a shorthand called '$'. So, I could say '1,$p'  and that would print all of the lines in the file.

Or I could say: "Gee! I wonder ... I just want to see the last line". So I could say '$p' and that would

give me that. I could even elide the 'p', but that's good enough.

Or I could delete the last line by saying '$d'. Or I could  delete the first line by saying '1d'.

That is sort of the line addressing. So far not very complicated.

The thing that 'ed' added to all of that, and this is definitely  Ken's influence was the idea of regular expressions.

text - its a way of specifying patterns of text.

They could be literal texts like the word 'print' or they could be  something more complicated, like things that start with

'Prin' and but might go on to 'Print' or 'Princeton' or 'Princess', or whatever, That kind of thing.

And the way that regular expressions were written in the 'ed' text  editor was you said '/' and

then you wrote the characters of the regular expression.  So, I could say '/print/'

and that would be something that would match the next line, in what I was working on, that contained the word 'print'

eSo the regular expressions in the 'ed' editor  were somewhat different - a little more

sophisticated, and complicated, than the regular expressions  that you might find in shell wildcards,

where, for example, a star means 'anything at all'. So,

the same idea of patterns of text - a slightly different

specification - a different way of writing patterns but suitable for  text editing. And so, then, I could say things like "I want to find the next

occurrence of the word 'print' in my file". And then there I would be.

And on, and on, and on, like that. OK, so that's the 'ed' text editor.

We are a long way away from 'grep' at this point.  So what's 'grep' all about?

Well, it turns out that at the time that this  was going on, 'ed' was the standard text editor.

But, as I said, the machines you're working on are very very wimpy.

Not much computing capacity in a lot of ways

one of the limitations was that you couldn't edit a very big file,

because there wasn't enough memory and the 'ed'  worked entirely within memory and

so you were stuck. One of my colleagues at the time,  Lee McMahon, was very interested in doing text

analysis. The sort of thing that we would call today,

And so what Lee wanted to do ... he had been studying

something that, at the time, was the very  interesting question of who were the authors of

some fundamental American documents called the Federalist Papers.  The Federalist Papers were written by,

variously, James Madison and Alexander Hamilton and John Jay in

1787 and 88, if I recall correctly, There were 85 of these documents

But they were published anonymously under the name Publius.  And so we had no idea, in theory, who wrote them

And so there's been a lot of scholarship trying  to figure out for sure.

It's well known who wrote some of them and others are still, I think, a

little uncertain and so Lee was interested in seeing  whether you could actually,

figure out who wrote these things. So that's fine. But it turns out  that these 85 documents was in total just over a megabyte

- I mean down in the noise by today's standards - wouldn't fit.  He couldn't edit them all in 'ed'.

So one day he said: "I just want to go through and find all the  occurrences of 'something' in the Federalist Papers

字幕リスト動画再生

GREPはどこから来たのか - Computerphile (Where GREP Came From - Computerphile)

sort

stick

bunch

figure