Placeholder Image

字幕表 動画を再生する

  • [MUSIC PLAYING]

  • DAVID MALAN: There's any number of languages in which we can communicate,

  • English being just one of them.

  • But computers, of course, only understand binary zeros and ones.

  • And so somehow we have to communicate our thoughts-- ultimately in binary--

  • in order to solve some problem with a computer.

  • But certainly, we don't want to program computers by writing zeros and ones

  • and memorizing the patterns that they'll understand,

  • so we do somehow need to adopt a process by which

  • we can express our thoughts and the solutions to the problems

  • that we have in mind but in such a way that the computer can understand them.

  • Now it turns out it's quite meaningful when

  • you see something like Intel inside or AMD

  • or any number of other computer manufacturers

  • that make what are called CPUs, Central Processing Units, which

  • you can think of as the brains of sorts inside of your computer.

  • Well, turns out that Intel and AMD and other companies

  • have decided in advance what patterns of bits--

  • zeros and ones-- that their CPUs understand.

  • A certain pattern of zeros and ones might represent addition.

  • Another pattern of zeros and ones might represent subtraction or multiplication

  • or division or the act of moving information around in memory

  • or saving information from memory.

  • And so those patterns are very much computer or CPU specific.

  • And frankly, I'd like to be able to write software and write

  • code that can run on your computer and my computer

  • and some other manufacturer's computer without really having

  • to know all about those zeros and ones.

  • So there too it would be nice if there's a process, a workflow, a tool

  • chain via which we can communicate our thoughts in a fairly accessible way

  • but have them ultimately translated into those zeros and ones.

  • So what is it that Intel inside really means?

  • What is it that a CPU actually understands?

  • Well, it's what's called machine code, zeros and ones that ultimately dictate

  • what the computer should do-- add, subtract, multiply, or something else

  • altogether.

  • That machine code literally might look something like this.

  • In fact, let me give you just a moment and from all of these zeros and ones,

  • can you glean perhaps what this very program would do if run on a computer?

  • No?

  • Well, odds are you couldn't imagine what this would do because even

  • I can't read these zeros and ones.

  • But if it turns out you fed these very zeros and ones to a computer,

  • it would print out on the screen quite simply

  • "hello world," which is sort of the canonical phrase, one

  • of the first phrases ever printed on a computer screen back in the day

  • when computer languages were first being invented.

  • Now you, of course, would never know that, should never know that,

  • nor should even the most sophisticated of programmers

  • because this is just too low of a level to communicate one's thoughts in.

  • Far more compelling would be to operate at a level closer to English

  • or whatever your spoken language might be.

  • And so quickly, when humans invented computers decades ago,

  • did we decide we need something different from machine code.

  • We need something at a higher level of abstraction,

  • if you will, something that's more familiar to us

  • but that's close enough that the computer can somehow figure out

  • what to do.

  • And so thus was born assembly code.

  • Assembly code is an example more generally

  • of what's called source code, which is typically

  • English-like syntax more familiar to us humans that can somehow be translated

  • eventually down to machine code.

  • And assembly code, which is one of the earliest

  • incarnations of this general idea, looked a little something like this.

  • Now it too looks pretty cryptic, though hopefully

  • not quite as cryptic as just seemingly random patterns of zeros and ones

  • because there is some organization to this text here.

  • It's not quite English-like I would say, but there's

  • some familiar sequences of characters that I can perhaps

  • ascribe some meaning to.

  • I see words that look a little familiar-- push queue or at least push,

  • move queue, or perhaps move.

  • Sub-- maybe that means subtract or call, as in to call a function

  • or a procedure, xor and add and pop and others--

  • these seem to be reminiscent of English words.

  • And so while cryptic--

  • and I would surely need a manual in order to figure out what these mean--

  • this is a standardization of how you might communicate instructions

  • to a computer.

  • Indeed, all of those keywords push queue, move queue, sub queue and so

  • forth are literally called instructions, and those

  • are the names given to the instructions, the commands that Intel and AMD have

  • decided that their brains, their CPUs shall understand.

  • To the right of these instructions is some cryptic looking syntax now--

  • dollar signs and percent signs, commas, and others.

  • Well, those are used to communicate what are called registers.

  • It turns out that the smallest unit of useful memory

  • typically inside of a computer--

  • in particular inside of a CPU--

  • is what's called a register.

  • A register might be eight bits back in the day, 32 bits

  • more moderately, or even 64 bits.

  • And that is really the smallest piece of information

  • that you can do some operation on, the smallest unit of information

  • that you can add to something else or subtract from something else and so

  • forth.

  • So when a CPU is doing its arithmetic-- addition, subtraction, multiplication,

  • division, and so forth--

  • it's operating on pretty small values.

  • They might be big numbers, but they only take up maybe 32 or 64 bits.

  • And those registers, those chunks of memory have names.

  • The names to be fair are cryptic, but they're expressed

  • in the same language, assembly code.

  • So that you're telling the computer in this language what should

  • you move to where and what should you add to what.

  • And so once you acquire a taste, if you will, for this language,

  • does all of this begin to make more sense.

  • And frankly, if I really scour it, aha, down here at the bottom,

  • I do see explicit mention of that phrase hello world.

  • And these other lines simply call in to action the printing of that phrase

  • on the screen.

  • But frankly, this doesn't look all that compelling still.

  • It's certainly better than zeros and ones,

  • but assembly code is generally considered

  • to be fairly low level, not as low level as zeros and ones and not as low

  • level as electricity from the wall.

  • But it's still low level enough that it's not really

  • that pleasant to program in.

  • Now back in the day, decades ago, this was all you had at your disposal.

  • And so surely, this was better than nothing else.

  • And in fact, some of the earliest games and some of the earliest software

  • were written in assembly language.

  • So it truly was experts back in the day writing frequently

  • in this low language, and you might still use it today

  • for the smallest of details.

  • But on top of assembly language have evolved more modern forms

  • of source code-- newer languages with easier to understand syntax

  • and more and more features.

  • And one of the first successors to something like assembly code

  • was a language called C, quite simply.

  • C looks like this.

  • Now I dare say this too remains fairly cryptic,

  • but I feel like we're walking up a set of stairs

  • here where things are finally starting to look a little more

  • familiar and a little more comfortable, even though there might still

  • be some distractions of syntax.

  • These angled braces and these curly braces

  • and quotes and parentheses and a semicolon--

  • all of that you might get to in an actual course on programming itself.

  • But here we use this as demonstrative of a fairly more English-like syntax

  • with which you can express the same program.

  • Printing hello world in a language called

  • C can be implemented with precisely this code.

  • But frankly, we're starting to stray pretty far from that low level language

  • that computers ultimately understand and need to accept as their input binary.

  • So how do we get from this higher level language, so to speak,

  • called C down to those zeros and ones?

  • Well, frankly, the process by which this happens

  • tends to make an intermediate stop in what's called that assembly language.

  • So a human might write code like this, like I did here

  • in C. You might then use a program that converts the C code to assembly code

  • and then another program that converts that assembly code down

  • to those zeros and ones.

  • And frankly, you could probably use a tool

  • that does both of those steps at once so that it

  • creates the illusion of going directly from this to so-called machine code.

  • Those zeros and ones.

  • So let's take a moment and actually do that on my computer here.

  • This is a process that you can do on a Mac or PC

  • running Mac OS, Windows, Linux, or any number of operating systems.

  • I happen to be doing it on a Mac here.

  • And I'm going to use a fairly common program these days called a text

  • editor.

  • This is a very lightweight version of a word processor.

  • It doesn't have bold facing and underline and italics.

  • It really just allows you to type text, but it

  • does tend to colorize it for you to draw your attention

  • to the disparate parts of a program.

  • And I'm also going to open up what's called a terminal window.

  • A terminal window is a keyboard-only interface to what your computer can do.

  • So while I still have access to my mouse,

  • it's not going to be all that useful for this environment

  • because anytime I want to run a program or execute a command or make my Mac

  • do something, I'm going to have to do it from my keyboard alone.

  • Let's take a look.

  • Now here I am in front of my text editor,

  • and I'm going to go ahead and create a file called sayhello.c,

  • dot c being a conventional file name to indicate to the computer

  • that the code I am about to write is going

  • to be implemented in that language called C. Now

  • at the top of this program is where I'm going to write my code,

  • and I'm simply going to transcribe what we just saw.

  • Include standard IO dot h int main void open curly brace

  • followed by a closing curly brace ultimately

  • and then printf quote unquote hello comma world backslash n, finally,

  • a semicolon.

  • So herein I've written my source code.

  • And indeed, it's been colorized by the text editor

  • simply to draw my attention to disparate parts of this program

  • and were we to dive deeper into C, in particular,

  • we'd see what each of these different symbols mean.

  • But for now, let me propose that we only care

  • about what this program is meant to do, which

  • is to print ultimately hello world.

  • But all I've done is write a program in C.

  • I somehow have to get it to its form of zeros and ones

  • that the computer ultimately understands.

  • So it's not sufficient just to save the file because, indeed,

  • all that's been saved are these letters and symbols in some file called

  • hello.c.

  • I somehow have to convert that file to zeros and ones.

  • Well, it turns out that there exists tools called compilers.

  • A compiler is simply a piece of software--

  • written presumably by someone else--

  • that knows how to understand C, perhaps knows about assembly code,

  • but definitely knows about those patterns and ones

  • that Intel and AMD ultimately require that I output in order

  • to get them to execute commands.

  • So how do I go about compiling hello.c?

  • Well, it turns out installed it on my computer--

  • and perhaps even yours-- is a program called CC, the C compiler, if you

  • will-- compile simply referring to this process of translating

  • one language to another.

  • And so I'm going to go down here to the lower portion of my screen

  • wherein I have a prompt, dollar sign that for whatever

  • historical reason simply represents a prompt in a terminal window.

  • And it's in here that I can type these textural commands,

  • cc -o hello space hello.c--

  • a fairly cryptic incantation of commands.

  • But ultimately, this has created a new file

  • on my computer called quite simply hello with no file extension.

  • Now on a typical Mac or PC, when you want to run a program,

  • you would typically just double click its icon, and it would be loaded up.

  • And you would see its ultimate behavior.

  • But here in this command line interface, so to speak,

  • wherein I can only type commands textually,

  • I have to tell the computer to run this program only via my keyboard.

  • And the convention via which you can do this

  • is quite simply to say dot slash hello where dot refers to the current folder

  • or directory in which this file is.

  • The slash just separates from its name, and hello, of course,

  • is the name of the program I've written.

  • And here we go.

  • With the stroke of enter, I now see hello world.

  • And thus was born my very first program in C.

  • Of course, it took me quite a while to get to this point,

  • and I didn't necessarily even understand all of those lines of code

  • along the way.

  • But I did understand that my goal at hand and the problem to be solved

  • was to print quite simply hello.

  • But there's a bit of overhead, of course,

  • to a language like C wherein there's not only the syntactic complexity of it,

  • but that frankly gets much more familiar over time.

  • There's also this additional step, this middleman,

  • a compiler that has to exist and somehow translate your source

  • code to machine code.

  • In fact, what we've effectively done just now is this.

  • If up here is my so-called source code stored in any file,

  • for instance, hello.c, and I want to convert it ultimately

  • to so-called machine code, the zeros and ones that my computer understands,

  • I somehow have to get from input to output.

  • And the middleman here is again this tool

  • called compiler in the context of my having written this program call in C.

  • I used a program, a compiler, called CC, but any number of other options exist.

  • You might have heard of Visual Studio perhaps or Eclipse or yet others still.

  • This middleman simply takes as input that source code in C

  • and produces as its output that machine code that the computer expects.

  • And so when I type that Command cc -o hello hello.c,

  • it ultimately was telling my computer take as input hello.c,

  • produce as output a new file called hello, inside

  • of which are those zeros and ones.

  • Now not all languages operate like this.

  • It turns out that more modern languages skip that step of compilation

  • altogether or at least hide that detail from the user

  • so that he or she doesn't necessarily need to know how to compile their code.

  • It's handled more automatically.

  • Now some languages instead use not a compiler but an interpreter.

  • Whereas a compiler takes as input one language like C and produces as output

  • another language like machine code, an interpreter

  • instead takes as input some source code and then runs it

  • or interprets it line by line top to bottom, left to right.

  • And whenever it sees an instruction like print, it thinks to itself,

  • oh, I know how to print something on the screen,

  • and it goes and does it on behalf of that source code.

  • It does not, strictly speaking, convert those instructions

  • instead to zeros and one.

  • It is instead the interpreter itself which

  • is just a program that itself is implemented with zeros and ones

  • that the CPU understands.

  • And those zeros and ones collectively know

  • how to recognize keywords or functions in that source code language

  • it takes as input in order to execute it on the program's behalf.

  • So what is an example of an interpreted language?

  • Well, among the most popular ones today is that called Python.

  • Python is especially popular in the world of data science and web

  • programming and in command line applications

  • ones written at the so-called terminal window via which I can solve problems.

  • And so Python is notable too for its relative simplicity-- certainly vis

  • a vie something like c.

  • In fact, in order to implement the equivalent program in Python

  • that I just implemented in C, it suffices to write just this.

  • Say what you mean and little more.

  • There's less syntax here.

  • There's fewer keywords that are unfamiliar.

  • It's just instead the verb or function print followed by hello world--

  • no semicolon, no curly braces, fewer symbols altogether.

  • But how do I go about running a program in Python?

  • Well, it turns out that typically Python is interpreted, not compiled.

  • So I'm not going to run it through a compiler per se,

  • but I am instead going to interpret it line by line-- albeit just one

  • line with this particular program.

  • So let me go back into my text editor and terminal window

  • and this time create a file called hello.py-- dot

  • py being the conventional file extension for any program written in Python.

  • And in hello.py, I am now going to write that one line

  • program print open parenthesis quote unquote hello world.

  • How do I interpret this file called hello.py?

  • Well, it turns out I run a program that itself is

  • called Python, which is my interpreter.

  • And so I run in my terminal window Python space hello.py,

  • and the output is now the same.

  • So what has just happened?

  • Albeit just one line, what that program called Python has done

  • is open up this file called hello, read it top to bottom,

  • left to right, albeit quite quickly, recognized

  • that it knows this keyword or function called print and therefore knew

  • what to do next.

  • It went ahead and printed hello world on the screen and then automatically quit.

  • So this seems like a nice thing.

  • No longer do I have to remember and take the time to compile my code,

  • but surely, there must be some price.

  • And indeed, one of the implications of saving that step no longer having

  • to compile your code but instead just jumping right

  • to its execution or interpretation is that you pay potentially

  • a performance penalty.

  • You certainly can't quite see it in a program as short as hello world.

  • But if you were to write a program with hundreds or thousands

  • or millions of lines, the overhead required

  • to read that file top to bottom, left to right,

  • and to figure out based on the instructions

  • therein what it is the programmer intended actually

  • does take non-zero amount of time.

  • And it can surely add up for the most computationally complex of problems--

  • anything involving an analysis, anything involving loops or cycles,

  • you can certainly begin to feel its effects.

  • But that's OK because we humans have been fairly creative over time.

  • And as we've invented more and more programming languages,

  • we have fortunately also invented more and more solutions

  • to problems like these.

  • And so it turns out that even though this all happened quite quickly, when

  • I ran this interpreter called Python, odds are if my computer were smart,

  • it was probably doing me a favor underneath the hood

  • without my even knowing.

  • And in fact, what Python and some other interpreters

  • do is actually compile your code for you,

  • save the results in a temporary file that you yourself might not even see,

  • and the next time I run this program, especially if it's large and complex,

  • Python will skip this step of reinterpreting the file again and again

  • and instead look at that precompiled version of my same program--

  • therefore, saving some time but achieving the same results.

  • Only if I go back and change my code and make changes to my program

  • does Python need to regenerate that cached version of code, if you will,

  • in order to reuse that again and again.

  • And this intermediately cached code is generally called byte code.

  • It's not quite zeros and ones, but it's closer to it than Python itself.

  • And so for this same program in Python, were my computer

  • to actually compile it for me, what I would actually see is code like this.

  • Much like assembly code is it fairly cryptic, but at least

  • in there is some familiar phrase hello world as well

  • as the function I ultimately called, which is that known as print.

  • Now Python and C are not the only languages out there.

  • In fact, there are dozens in vogue these days.

  • And there are hundreds-- if not thousands--

  • that humans have created over time.

  • For instance, depicted here is perhaps one with which you

  • yourself are familiar or at least heard of.

  • It's called Java, and it happens to be an object oriented programming

  • language in a language, which means it has features

  • beyond those earliest of ones like C. Here, though,

  • is perhaps the simplest way via which you can implement that same program

  • hello world, but it-- not unlike C-- has a bit

  • of overhead, a number of samples and words that at first glance

  • certainly are not obvious.

  • But ultimately, that's all this program does.

  • But Java is distinct in that it took a different approach to another problem

  • that we've not yet tripped over.

  • I've been running and running this program thus far in my Mac,

  • and I compiled it particularly for this Mac on an Intel CPU.

  • But it certainly stands to reason that you or someone else

  • might not have the same computer or operating system as I,

  • and it would seem to be quite the burden on the programmer

  • if they have to compile their code in a different way for you and for me

  • and for everyone else.

  • And so it turns out that this cost of doing business, if you will,

  • shipping different shrink wrapped boxes back in the day of the same program

  • for different computers and OSes is ultimately

  • solved by way of a virtual machine.

  • A virtual machine, as the name implies, is not

  • a physical machine but a virtual one implemented, as they say in software--

  • software that humans have written that mimics the behavior

  • of a virtual imaginary machine.

  • And then companies like Sun and others have implemented support

  • for that virtual machine for Macs and for PCs

  • and for multiple operating systems.

  • And so Java subscribes to the monitor of write once run anywhere.

  • You needn't compile it again and again for different platforms,

  • rather you can install on each platform its own virtual machine

  • and run the exact same code.

  • So it's simply a different approach to an otherwise omnipresent problem,

  • but Java falls into a class of languages that uses that particular technique.

  • And what other languages are out there?

  • Well, very popular these days on both servers and clients is a language

  • called JavaScript, such as that pictured here.

  • Here is yet another language-- this one called

  • Ruby-- that replaces the word print with just put,

  • but it too is more syntactically simpler, much like Python itself is.

  • On the other hand, here is C++, incredibly common still,

  • especially on PCs, along with other languages as well.

  • But in C++, you see code reminiscent of C itself, also conventionally compiled.

  • Now if you'd like to see any number of overwhelming examples of how you might

  • quite simply in hundreds of languages say hello world,

  • take a look at this URL here.

  • And in fact, there's so many other languages

  • that are popular and powerful, sometimes in different ways.

  • In fact, it's not just the case that you use one particular language

  • for one particular job.

  • There are many tools that you might bring to bear

  • on the exact same problem at hand.

  • In fact, the reason that so many languages exist

  • is that over time, we humans have perhaps rightly or arrogantly

  • decided that we can do better than languages past.

  • And so humans invent new languages that have other features,

  • different approaches, and of course, reasonable people can disagree.

  • And so you have some languages that can achieve the very same task.

  • They just do it differently.

  • The text looks a little different.

  • The features are a little bit different.

  • And so it's up to the programmer as part of the design process

  • to decide what tool is best for the job, not unlike a physical tool

  • that you might have in a home tool chest.

  • Among the most popular these days perhaps are these here Bash and C

  • and C++ and C#, Closure and Erlang and F# and Go, Haskell, Java, JavaScript,

  • Objective OCaml and then PHP, Python, R, Ruby, Scala, Scheme, SQL, Swift,

  • and so many more.

  • In fact, if you'd like to see a nearly exhaustive list of all

  • of the language humans have invented, take a look at Wikipedia

  • here, which goes into so much more detail.

  • So suffice it to say that computers can print hello world,

  • but they can do so much more as well.

  • So what are the basic building blocks that can be found in languages like C,

  • like Python, Java, C++, and any number of others?

  • Well, let's return to the pseudocode with which we began to program, albeit

  • verbally, some time ago.

  • Here we had the algorithm via which to find Mike Smith in a phone book

  • and to search more generally.

  • Recall, though, that laced throughout this pseudocode or program

  • really were a few constructs that were semantically distinct from each other.

  • Let me go ahead and highlight in yellow some of the verbs that we saw last.

  • Pickup and open to and look at call, open and open and quit--

  • all of these were calls to actions, verbs or functions,

  • as we described them previously.

  • Well, it turns out that in C and in Python and other languages as well,

  • we have not necessarily these same words but these same actions.

  • For instance, we've seen in two languages

  • already that you can print hello world on the screen.

  • That function or verb in C happens to be called printf.

  • In Python, it happens to be called more specifically print.

  • And so those are examples of functions in those languages as well.

  • What else did we see last time?

  • Well, we had if and else if and else if and else,

  • and we describe these as conditions or branches

  • via which you can make decisions and either go left or right down

  • the fork in the road.

  • Well, in Python in C and in other languages

  • too do we have those same features as well.

  • We then looked at Boolean expressions, questions

  • that you can ask to which there are yes and no or true and false answers.

  • In Python and C, we're going to see those as well.

  • And lastly, we saw loops-- go back to step two, go back to step two--

  • a programming construct via which you can do something again and again.

  • Now here we did this in order to keep looking for Mike.

  • But in the real world, might you be searching for data in a large database

  • or an Excel file, a CSV file, Comma Separated Values.

  • And so you might want to have some programming code that

  • opens that file and then iterate or loops over one line

  • after another doing some calculation or analysis,

  • loops enable us to do exactly that.

  • So let's go ahead now and see if we can see

  • these constructs in a common modern programming language.

  • We'll use Python if only because it's incredibly commonly used at the command

  • line within a terminal window.

  • It's incredibly commonly used nowadays for web programming.

  • And it's ultimately incredibly useful in the context of data analysis and data

  • science, more generally.

  • So let's begin where we left off recreating a program in Python that

  • quite simply says hello world.

  • To do that in my text editor here, I've created

  • a file now called hello0.py to suggest that we're going to do this

  • a few times, starting at zero, in order to iteratively improve on this program.

  • Well, herein I'm going to do print quote unquote hello world, saving my file.

  • No need to compile it but I'm now going to go ahead

  • and type Python space hello0.py.

  • And there we have hello world.

  • Now this program, of course, is by definition not terribly dynamic.

  • No matter how many times I run this program,

  • it's going to print hello world, hello world again and again.

  • Suppose that I instead wanted to get user inputs.

  • Well, it turns out that Python--

  • like other languages-- has a function via which you can do exactly that.

  • And a function, of course, takes optionally inputs

  • and optionally produces some output.

  • Now in Python, perhaps the easiest function with which to get user's input

  • is called quite simply input.

  • So let's see how we might use that.

  • Let me go ahead and create a new file called hello1.py.

  • And in this file am I first going to call that function called input,

  • and input is called exactly that.

  • But it takes as input some input, which is to say the string or the phrase,

  • the sentence via which you want to prompt the human for his or her input.

  • So for instance, I'll go ahead here and ask

  • what is your name question mark space.

  • And now this function is going to return to me a value, so to speak.

  • I don't know how input is implemented.

  • Someone else yesteryear implement this for me.

  • But I do know per its documentation that when I call it,

  • it's going to do the equivalent of handing me back a slip of paper

  • on which is written the user's input from his or her keyboard.

  • But for me to make use of this input, I need to store it somewhere.

  • And in a programming language, much like in math,

  • you have access to what are called variables.

  • Now a mathematician might call their variables x or y or z.

  • And in programming could we also call our variables

  • the same but more conventional is ways to use

  • names that are a little more descriptive-- actual words that

  • describe their contents and the means by which I can store the return value,

  • so to speak.

  • What is handed back from a function like input is in a variable

  • that I'll call perhaps aptly name.

  • And in order to store the return value of input into a variable called name,

  • it suffices simply to use a single equal sign.

  • And this equal sign is not to be confused with the equal sign

  • that you and I know in math.

  • After all, in math, using an equal sign would

  • apply that what is on the left equals what is on the right.

  • And ultimately, that's our goal.

  • But initially, we only know the value on the right--

  • the so-called return value from input that's handed back to me, so to speak,

  • from the function.

  • So in fact, with the equal sign in many programming languages is called--

  • Python among them-- is the so-called assignment operator.

  • And so you consider what's happening here as being this.

  • On the right-hand side is a function.

  • It's handing back some value, the user's name.

  • The equal sign implies copy that value from right

  • to left into whatever is on the left.

  • And so if on the left I have a variable called name,

  • that variable-- much like x or y or z-- is

  • going to store the user's input ultimately,

  • so I can use it subsequently in some other line of code.

  • Now what might that subsequent line of code be?

  • Well, suppose I'd like to print not just hello world but hello, David,

  • or hello, someone else.

  • Well, now I can simply structure the second line is taking as input

  • the contents of that variable.

  • So I might go ahead here and say print quote unquote hello comma,

  • and I now somehow need to append to that shorter phrase

  • the name that has actually been typed in.

  • And there's a number of ways we can do this.

  • For instance, we might do it as follows.

  • It turns out in Python that you can use plus not

  • only to add two numbers together but also, in some sense,

  • to add two words or two strings as they're called in a language.

  • And if I do plus in between hello and that variable name,

  • the effect is going to be to join or to contaminate those two strings of text

  • together so that would ultimately is printed

  • is hopefully hello comma David or hello comma yourself.

  • So let me go ahead and save this file and in my terminal window now run

  • Python of hello.py, I'll input my name David.

  • And now notice my input is separated by one space from that question mark

  • because on line one did I preemptively include

  • that space between those double quotes.

  • Now I'm going to go ahead and hit enter, the effect

  • of which is to have input return that value, storing it

  • in the variable called name, and then in line two, just go ahead

  • and print hello comma followed by name.

  • Now just to be clear that this is not hardcoded somewhere else.

  • Let me go ahead and run it one more time.

  • Python hello1.py.

  • And let's go ahead and type your name and there too

  • do we see hello your name.

  • So the program is now dynamic, and I've outputted dynamically

  • whatever it is the human is typed in.

  • But it turns out there's any number of other ways we can do this.

  • And so just so that we've seen a few different ways,

  • let me go ahead and create a new file called hello2.py

  • wherein we have almost the same code, but we're displaying your name

  • or mine a little bit differently.

  • As before, I'll go ahead and define my variable called name, assign to it

  • the return value of input asking the user for his or her name.

  • And then on my second line of code, we'll again go ahead

  • and say print hello comma.

  • And then following that, I actually have a choice.

  • I don't necessarily have to just concatenate

  • one input onto that first string hello.

  • I can actually combine these as two separate inputs to print.

  • It turns out that print, like many functions,

  • can take either zero or one or two or any number of inputs.

  • You simply, as the programmer, need to separate them

  • by commas, not the comma that's inside of the quotes.

  • That's just my English grammar separating hello from your name

  • but outside of the quotes.

  • And so my text editor is displaying it a little bit disparately in white.

  • So in this case, am I using the same function

  • print passing in not one but two arguments or inputs, the second

  • of which is that actual name.

  • Now there's a minor bug here that I'm not yet anticipating,

  • but let's see what happens next.

  • If I go ahead now and run Python hello2.py, type in my name, voila

  • it's so close and almost right.

  • But here perhaps is my first bug.

  • It's not necessarily a bug that's breaking the program altogether.

  • But to be a little bit nit-picky, I think we could do a little bit better,

  • say, grammatically there seems to be two spaces instead of one

  • but where are they coming from?

  • Well, it turns out that functions, when they take inputs,

  • can also have default behavior.

  • And in this case, print is designed by its authors

  • whenever you pass in two or more arguments

  • to just separate them by default with one space.

  • Now I can fix this mistake in a couple of ways.

  • But the simplest way is just to remove it from my own input,

  • saving the file again and rerunning Python of hello2.py

  • gives me now what is your name?

  • David.

  • And we're back-- hello, David.

  • Now it turns out there is yet more ways than you can format information

  • on the screen.

  • And just so that we've seen one other way,

  • allow me to create a fourth file called hello4.py.

  • In hello3.py, am I going to start almost exactly the same declaring a variable

  • called name, assigning to it the return value of input,

  • asking the user for their name.

  • And then in my second line of code am I again going

  • to use print starting with a parenthesis and a close parenthesis.

  • In there am I ultimately going to say hello comma something,

  • and the something this time is literally going to be name but not quite name

  • on its own.

  • Indeed, if I were to run this program now,

  • what do you think might be printed?

  • I've said to the program print hello comma name, but unfortunately,

  • name is as written, N-A-M-E. And so I would literally see that on the screen

  • no matter what I typed in.

  • But it turns out in Python, if you surround your variable's name with

  • curly braces, typically found just above the Enter key--

  • at least on a US keyboard-- you can tell Python that this N-A-M-E is actually

  • the name of a variable, not just a string of text on its own.

  • But in order to tell Python that it should treat this string, this input

  • to print a little bit differently than usual,

  • you have to fairly cryptically prefix it with a single F,

  • thereby telling Python that this is what should

  • be called a format string or string--

  • That is, a string of text a sequence of characters

  • that should be treated as formatted in a special way.

  • And according to Python's own documentation,

  • if you format your input using these curly braces inside of which

  • is the name of a variable, Python will go

  • through the trouble of plugging in the value of that variable

  • there and therefore formatting it for you.

  • And so I'll again go ahead and run Python of hello3.pi, input my name,

  • and voila, we're back to Hello, David.

  • So what are the takeaways then from the simplest of programs?

  • Well, we clearly have the ability to print information on the screen.

  • But we have at our disposal a function, a piece of code

  • that someone wrote long before me that takes as input perhaps just

  • a string like hello world.

  • But if you pass it multiple strings will it handle those as well.

  • And if you pass it a special type of string will it handle that as well too.

  • And so depending on the documentation for some language

  • do you have these and so many more features available to you.

  • And so one of the first steps in learning any programming language

  • is not to take a formal lesson or class but simply,

  • quite honestly to read the documentation.

  • And once you have under your belt some knowledge of one or two

  • or few programming languages, it is much easier in the computer world

  • to pick up new ones than I daresay it is in the human world where you

  • have a much larger vocabulary at hand.

  • All right, enough about printing.

  • What more can we do?

  • Well, programming languages can typically

  • handle any number of arithmetic or mathematical operations--

  • addition, subtraction, and many others among them.

  • So why don't we go ahead and create a program that quite simply prompts

  • the user not just for their name or string

  • but rather two inputs, two numbers.

  • And we'll call them more familiarly x and y.

  • I'm going to go ahead and declare a variable called

  • x assigned to it from right to left the result of asking

  • for the user's input of x.

  • And then I'm going to go ahead and do precisely the same thing

  • on another line of code defining a variable called y

  • and assigning to it from right to left the return value of another call

  • or invocation of input--

  • this time, prompting the user for y.

  • And then I quite simply am going to go ahead and print out the result.

  • So print x plus y--

  • this time not using any double quotes at all

  • because what I literally want to print is x plus y,

  • not the concatenation of two strings, not a string at all,

  • but rather just a number.

  • In my terminal window now will I go ahead and run Python

  • on the name of this file, arithmetic.py.

  • I'm prompted for x, which I'll say one.

  • I'm prompted for y, so I'll say two.

  • And indeed, 1 plus 2 is 12, but no 1 plus 2 is not 12.

  • So what's actually happened here?

  • Well, this is the first real bug, if you will, that I've introduced to my code.

  • Now step one here has x equals input, prompting the user for x, and step

  • two has y equals input prompting the user for y--

  • little different from before that we've chosen different variable names.

  • Now on line four--

  • and I've separated with this blank space just

  • to make it a bit easier to read do I have print x plus y.

  • But 1 plus 1 is surely not 12.

  • But what appears to be happening?

  • Well, it's no coincidence that the answer Python's

  • giving me is my first input followed by my second.

  • It would seem, in fact, that Python is contaminating

  • my first input to my second that is joining them just

  • like hello comma David.

  • So why is that happening?

  • Well, I would hope that plus when given two numbers

  • would, in fact, add those two together.

  • But here's a catch.

  • It turns out that underneath the hood, Python, like many languages,

  • actually has what are called types or data types--

  • that is to say, different ways of representing information

  • if that information is a number like an integer

  • or even a real number with a decimal point

  • or a string of text that is a sequence of characters or maybe more

  • generally, a value like true or false, a so-called Boolean value, and even

  • others still.

  • And in this case, indeed, it turns out-- and you would only know this by trial

  • and error by having read the documentation first--

  • does the input function built into Python return

  • to you not a number, but a string.

  • Even though what I typed on my keyboard looks like a number

  • and surely is in practice, it's actually not being stored as such by Python.

  • It's being stored and said in such a way that the computer is interpreting it

  • as a string of text.

  • So we have to be ever more explicit.

  • And indeed, computers don't necessarily know what I intend.

  • Maybe the goal at hand was to write a program that concatenates

  • one number against another.

  • And so if I really want the user's input to be treated as numbers,

  • I somehow have to coerce it to such or convert it

  • or more technically, to cast it from string to an int.

  • Now it turns out Python has other functions with which I

  • can fix this mistake, and I might do this in between these lines here.

  • X should not just be whatever the user typed in.

  • I want x to be the result of converting whatever

  • the human typed in into the integer or into version of that string.

  • Meanwhile, can I fix y by the same y equals int of y,

  • thereby telling Python even though, yes, the human typed something

  • in a keyboard, therefore implying a string,

  • go ahead and convert that one and that two

  • to an actual integer underneath the hood, a pattern of bits, if you will,

  • that represents not the ASCII value or the Unicode value

  • that the user typed in but the underlying pattern of bits

  • that represents one and represents two.

  • Let's go ahead now and rerun this file as Python arithmetic py,

  • inputting again 1, inputting again 2, and lastly, hitting Enter.

  • This time, we have three.

  • Unfortunately, we've paid a bit of a price, albeit just a matter of style.

  • I seem to have increased the length of this program from just three lines

  • to five just to fix a simple mistake.

  • But it turns out that just as functions take input and produce outputs,

  • so can you pipeline them, so to speak, or nest them so that one function's

  • output is another function's input.

  • The result of which is that we can kind of tighten this up.

  • I can instead on line 2 here delete what I have

  • and on line one alone simply say that once

  • I get back the return value of input--

  • the user's input, if you will, return on a conceptual sheet of paper--

  • go ahead and pass it immediately to that function called

  • int and surround the whole thing with parentheses,

  • thereby passing the output of input into the input of int

  • and then assigning the result from right to left.

  • And again, here can I get rid of line 3, passing the output of this input

  • to the input of this int and again surrounding it

  • with parentheses in order to pass its output into its input,

  • ultimately assigning from right to left that value.

  • Let's to be sure go ahead and save this file.

  • And then in my terminal window, run one final time Python of arithmetic.py,

  • inputting 1, inputting 2, and still do we get 3.

  • Suppose, though, that we not only want to take a user's input

  • but make a decision based on it, taking out for a spin

  • this notion of a condition with some Boolean expressions.

  • How might we do that?

  • Well, let me go ahead and create here a file called conditions.py,

  • inside of which I'm again going to ask the user for two inputs,

  • call it x and y, and then I'd like to determine whether x is less than y,

  • greater than y, or exactly the same.

  • Well, as before, I can declare my variable called x, assign to it--

  • preemptively this time--

  • the result of passing to int the return value of input

  • asking the user for just x and then define another variable

  • called y, assigning to it the return value

  • of int, which is passed, in turn, the return value of input,

  • asking the user for y.

  • And now with these two numbers in hand, am

  • I going to proceed to do the following.

  • If x is less than y followed by a colon, indented below that,

  • I'm going to say quite simply, well, you know what?

  • X is less than y, simply to inform the user as much.

  • And then back aligns with the if.

  • Am I going to say else if x is greater than y with a colon

  • and then indented below that, print x is greater than y.

  • Now those are the semantics that I intend,

  • but it turns out you have to read the fine print.

  • In Python, it is not else if that you can use.

  • Humans some years ago decided that slightly more succinct than else

  • if would be literally elif.

  • And that, in fact, is the correct syntax to use when you have a second fork here

  • in the road.

  • But indeed when we have now a third, we might do this else if--

  • there I go again--

  • elif x equals y, let me go ahead and say print x equals y.

  • But there's a bug here already because equal does not mean what you think.

  • Indeed, before we've been using the equal sign as the assignment operator,

  • so to speak, copying from right to left some value.

  • And in fact, here on line 8, there's already a bug.

  • Using a single equal sign between my x and y

  • here would have the effect not of comparing those two values

  • but instead copying from right to left that value in y

  • into that value for x, thereby making them equal no matter what.

  • And so it turns out that we humans painted ourselves

  • into a bit of a corner some years ago.

  • We've already used equal to assign one thing to another.

  • So it turns out that humans in many languages

  • decided well, let's still use equal but two of them back to back.

  • And so if you want to use not the assignment operator but the equality

  • operator, do you want to use two of these things back to back.

  • And so now I have a program that asks quite simply if x less than y, print

  • as much.

  • Elif x greater than y, print as much then.

  • Elif x equals y, print the same, x equals y.

  • But we don't necessarily need this third condition, it turns out.

  • Logically, if you've got two numbers, two integers,

  • I'm pretty sure, by definition, it'll either be greater than 1,

  • less than the other, or exactly the same.

  • And so we can save a tiny bit of time here

  • by not even asking that third question.

  • If I know that x is not less than y and I know that x is not greater than y,

  • I might as well just confer logically that they must be actually equal.

  • And so here we have the first of my Python programs

  • that much like my pseudocode for finding Mike Smith allows

  • me to take users' input and then compare it

  • in order to take a different fork in the road based on its value.

  • So my Boolean expressions here are x less than y and x greater than y

  • and that's it.

  • And my conditions here or the syntax with which I induced these branches

  • are my if, my elif and my else.

  • But important in Python--

  • unlike in pseudocode-- is some of this syntax.

  • The colon very specifically say do the following if this is true,

  • and the indentation, the multiple spaces--

  • here I typed four--

  • is ever so important as well.

  • Whereas many programming languages are a bit loose

  • when it comes to whitespace, so to speak,

  • how many times you hit the spacebar or tab.

  • Python enforces that you have the same amount of indentation everywhere.

  • And so if I want lines four and six and eight here to all line up logically,

  • they must literally do so in the file.

  • Similarly, must lines five and seven and nine line

  • up just right so that they are only executed

  • if the lines are just above them are true.

  • Let me go ahead and save this file.

  • And in my terminal window, run Python of conditions.py,

  • typing in shall we say 1 for x, 2 for y.

  • And indeed, x is less than y.

  • Let's run it again, this time, flipping things around.

  • X is 2.

  • Y is 1.

  • And indeed, x is greater than y.

  • And one third time, which should be inferred if x is 1 and y is 1,

  • then, indeed, x equals y.

  • Now programming is not all about numbers.

  • In fact, pictured here in a program I wrote

  • already is answer.py wherein we have lines of code that, again,

  • prompt the user for input but this time, leave it as a string.

  • And so on line four am I asking the user for his or her answer

  • to a yes or no question.

  • Presumably the user might type little y or little n or perhaps capital Y

  • or capital N.

  • And indeed, I'd like this program to handle any number of those cases,

  • as any program might.

  • On line 7 here am I asking then two possible questions.

  • If c-- the name of the variable I gave to the users' input--

  • equals equals capital Y. Or to be robust, that variable c equals

  • equals lowercase y.

  • Let me go ahead and conclude and print that the user meant yes.

  • Meanwhile, if c equals equals n capitalized

  • or if c equals equals c in lowercase, similarly

  • do I want to conclude that the user meant no.

  • And so here the new operative word is quite literally or.

  • In Python, it tends to be fairly English-like,

  • ever more so than C and other languages where

  • if you want to do something or something else,

  • you literally say quite simply or.

  • If I wanted both situations to be true, albeit illogically,

  • could I use the actual word and.

  • But of course, the user's input can't simultaneously be capital

  • Y or lowercase y.

  • And so here is using or apt.

  • Now when programming, you don't have to use

  • only those functions that are handed to you by the particular language.

  • You yourself can invent functions of your own.

  • For instance, let me go ahead and in a file called

  • return.py implement a program that takes as input an integer

  • or number from the user and then quite simply prints out the square thereof.

  • So if the human types in 2, I'll print out 4.

  • If the human types in 3, I'll print out 9.

  • And so how when might we go about doing this?

  • Well, perhaps in a familiar way now, x shall

  • equal the result of converting to an integer whatever the user's input is

  • after prompting them for x.

  • And then I'm going to go ahead quite simply and print out, well, x times x.

  • That is the square of x.

  • I'll go ahead and save this and in my terminal window

  • run Python on return.py and as proposed, square 2 and as proposed, square 3.

  • And indeed, this program works exactly like this but to square a value

  • has kind of a nice ring to it.

  • And the fact that it happens to be implemented as x times x

  • is really just an mathematical implementation detail--

  • something that I shouldn't really have to worry about or remember.

  • I would just like to square the user's input.

  • So wouldn't it be nice if there were a function in Python--

  • or any language for that matter--

  • quite simply called square.

  • Indeed, I can make that happen, whether or not it exists,

  • and simply define it myself.

  • And so here I'm going to go and do the following.

  • Using Python's keyword called def--

  • short for define--

  • I'm going to go ahead and define a function called square.

  • I'm going to specify to Python that that new function shall take input

  • that I'll arbitrarily but conventionally call n for number.

  • And then with a colon as before and some indentation 2

  • am I'm going to go ahead and return quite simply n times n.

  • In other words, the math is the same.

  • The implementation details are the same.

  • But what's new here is this new keyword return.

  • Just like with the input function built into Python,

  • some human in that function's own implementation

  • had a line of code that said return to the user whatever they have typed in.

  • Here am I doing the same but returning not some users' input but rather

  • n times n.

  • And so you can think of this function called square much like that input

  • function, jotting down on a digital piece of paper that value,

  • handing it back to the caller or whoever uses this function,

  • and letting them use it as they see fit.

  • So now rather than use x times x myself can I more conceptually clearly say

  • square of the user's input x.

  • And the fact that x is not the same as n is quite OK.

  • It is this function square that presumes to call its own input n.

  • I can call my own input to square whatever I want, say, x.

  • So let's go ahead now and run this program and see what happens.

  • Based on these definitions, it would seem that I could square x in this way.

  • If I go ahead and run this program again, Python return.py, and type to,

  • I get more output than I surely intended.

  • In fact, this is the first of my truly bad mistakes.

  • Here do I see what's called traceback, which is sort of a trace

  • or a log of everything the computer tried to do.

  • And you'll see some hints, however, arcane this output, that line two is

  • where my mistake probably is.

  • In particular, I have some kind of name error in Python

  • where the named square is not defined, and yet it's right here.

  • Well, it turns out that Python and a lot of languages

  • take things fairly literally.

  • And if when you're interpreting your file

  • they're reading top to bottom, left to right,

  • unfortunately, it's too late to define square on line four

  • if you yourself want to use it on line two.

  • But we could surely fix this logically.

  • As by moving that code up top down below, defining square at the top,

  • writing my own logic below, now trusting that Python will see square and only

  • use it on line five.

  • Let's go ahead and save this file, rerunning Python of return.py,

  • again typing 2.

  • And voila, now it works.

  • But this isn't quite the most conventional way to solve this problem.

  • As naive as Python is reading your code top to bottom,

  • it's a bit of a regression now, a bit of a mistake

  • that I'm putting the actual code that I care

  • about at the bottom and the actual code that I

  • was trying to abstract away, if you will, and give name to at the very top.

  • At the end of this day, the program I care about

  • are these lines here, not my implementation of square.

  • And so I would actually prefer, albeit a bit nit pickily,

  • to put that code actually where it was.

  • And so if you do that, thereby keeping the main part of your program

  • at the top, as is convention, you still have to solve the same problem.

  • So how might we do that?

  • Well, the Pythonic way or the conventional way in Python

  • is to do this--

  • to define a function that most people call itself main, again,

  • ending it with a colon, indenting below that the lines you have written.

  • And then at the bottom of the file is the very last thing

  • you do, telling Python to call that function called main.

  • Because here now is what Python will do.

  • It will read this file in interpreting it top to bottom, left to right,

  • defining a function called main and then here defining a function called square

  • and then here calling one of those two functions, which

  • in turn, calls the second.

  • But in this way have you taken care to define all of your functions first

  • and never calling any of them until everything's been defined.

  • Now so that you've seen it too, it's not quite conventional to just run

  • main at the bottom.

  • Instead, you'll typically see a more magical incantation like this.

  • If underscore name underscore underscore equals equals quote unquote underscore

  • underscore main underscore colon, indented below that

  • will be your actual call to main.

  • This, for more arcane reasons, ensures that if you're

  • using a small program as part of a bigger one,

  • it won't necessarily get executed at the wrong time.

  • But logically, what the key takeaways here

  • are what you can actually do by defining your own functions.

  • Here too do we have an abstraction.

  • What does it mean to square two values?

  • Simply multiplying one against the other.

  • But it would be nice to just refer to that as a verb unto itself like square.

  • And so by defining this function do we now abstract away that multiplication

  • and just treat this as the idea we actually care about.

  • Now, of course, we're not saving all that much time,

  • and my programs even bigger than need be.

  • But it's demonstrative of this principle of abstracting away implementation

  • details and building your more interesting product on top of work

  • that you or someone else has already done.

  • Now what if we want to do more than just square a number?

  • We instead want to prompt the user for input, convert that input to a number,

  • and then ensure that that number is the type of number we want.

  • Indeed, it's not uncommon in a program to ask the user for a positive

  • integer-- something useful--

  • again and again until he or she provides just that.

  • For instance, the user might type 0 or negative or something else still,

  • but you want to pester them again and again until they

  • provide exactly the input you expect.

  • Much like in a website where you're forced to type a number

  • or email address or something else, similarly

  • can we do that in Python in code.

  • So let's go ahead and do exactly that.

  • And assume for the moment that there exists already a function called, say,

  • get positive int--

  • a function whose purpose in life is to get from the human an

  • integer from one on up.

  • Let me go ahead and preemptively this time

  • to find my own main function with def.

  • And inside of that code, go ahead and declare a variable called I for integer

  • and then just presume for the moment to call

  • a function called get positive int, which itself will take a prompt as

  • before asking the user, say, for I. And what do I

  • want it down to with this number?

  • Well, let's keep it simple for now and just print I itself.

  • But I now need to implement that function called get positive int.

  • So for that, I can use def and say def get positive int.

  • But I need this function to take itself a prompt.

  • And I'm going to go ahead and call it exactly

  • that, which is to say when I call this function,

  • as I've done on line two passing in some string of text

  • that I want the user to see, well, in my definition of get positive int on line

  • five, I need to tell Python to give that input a name,

  • so I can refer to it ultimately.

  • Because, indeed, when I'm going to do after adding that colon

  • and indenting underneath is ultimately, we

  • want to call input, passing in precisely that prompt.

  • After all, get positive int is not going to presume

  • to know in advance what the programmer wants to prompt the user with.

  • Instead, it's going to get it just in time via that input or argument.

  • But I need to pester the user again and again if they don't actually

  • give me a positive int.

  • And so how might I do that?

  • Well, just like in pseudocode, we might define for ourselves loops--

  • blocks of code that do something again.

  • And again as we go to step two, as before, so can I do that in Python

  • in any number of ways, but perhaps the simplest here is this--

  • to simply say you know what, Python?

  • Go ahead and give me an infinite loop while true--

  • while being my operative word here, inducing a loop while something

  • is true.

  • Well, you know what's true always is the word true.

  • And indeed, built into Python are Boolean values--

  • true and false literally-- by definition, capital T and capital F.

  • So by saying while true colon, I'm saying Python, please

  • go ahead and do something again and again until I tell you to stop.

  • Well, what do you want Python to do in this loop?

  • I want to go ahead and declare a variable called say n,

  • assign to it the return value of calling that in function,

  • passing to it the output of input.

  • And then and only then if the human has obliged and given me a positive int,

  • I'll go ahead and say, well, if n greater than zero,

  • go ahead and break out of this loop.

  • So a different approach than we saw in pseudocode where I simply said

  • go to and go to and go to again.

  • Here I've instead said, Python, do this forever until I say break.

  • And only once n is greater than zero, as per the user's input,

  • do I break out of this loop entirely and therefore return

  • when I'm ready that value called n.

  • And so here on my last line of code am I again using return, handing back,

  • if you will, a sheet of paper on which is that number.

  • But I only reach this line 10 after I've said break on line 9,

  • and so does this function get positive in ultimately return exactly that.

  • So as always, let me go ahead now and save this file but only

  • after adding that last cryptic line.

  • If the name of this file is implicitly underscore underscore main,

  • then do I want to go ahead and call main so

  • that we avoid all of those issues of code in the wrong order.

  • I'll go ahead and click Save and do Python of positive.py,

  • providing an input, say, negative 1, being

  • prompted again for a number so negative 2-- still

  • not a positive int nor a zero.

  • But if I finally type of value like one do

  • I actually see the one that I inputted.

  • But it turns out Python supports other types of loops

  • as well, not just via this keyword called while but actually

  • via a preposition called for.

  • For instance, suppose that I want to implement a program that,

  • not unlike a charting program like Excel,

  • prints for me some kind of bar chart.

  • These bar charts will be purely textual using, say,

  • hash marks to represent values.

  • But to do this, I'm going to have to prompt the user for input

  • and then print out precisely that many hashes horizontally.

  • Well, let's see what I get.

  • I'm going to go ahead and as always prompt the user for input.

  • We'll call it say n.

  • And that user's input shall be converted via int after asking them

  • for that value of n.

  • And then once I have that value do I want to iterate

  • that is loop some number of times--

  • some number of times equal to whatever the user's input was.

  • So if the user has inputted one, I'll print just one hash mark.

  • If the user inputs 10, I want to print 10 of those hashes.

  • But how do I do this?

  • A while true loop or forever loop that infinitely loops is probably

  • not the right approach here.

  • But rather I want to iterate some finite number of times.

  • And so a for loop allows us to do exactly that

  • with built-in functionality as follows.

  • Let me go ahead and say Python for I in the range

  • of n, which is the user's input, go ahead per the colon

  • and do the following next.

  • Go ahead and print out a single hash for each value.

  • And so what is this line of code doing?

  • Here on line 3 do I have for I in the range of n.

  • Well, it turns out that range is a function built into Python

  • that returns to you effectively a range of values.

  • By default, that range starts at 0 and goes up to but not through the value

  • you ask for.

  • So if you pass to range the value like 1, you will iterate only one

  • time the range of 0 to but not through 1.

  • If you instead input a value of 10, you'll iterate over a range of 0

  • through 9 up to but not through 10 and get precisely that many hashes.

  • My goal, again, is to print a bar chart of sorts

  • with one hash representing each of these values

  • from left to right, a horizontal bar chart.

  • So let me go ahead and save this file here and in my terminal window

  • run Python of score.py.

  • We'll input a number like 10.

  • And unfortunately, they all seem to be vertical.

  • And if I scrolled up higher in my terminal window would I see all 10

  • but again, one on top of the other.

  • So how do I somehow keep my cursor, if you will, on the same line?

  • Well, all this time, I've been using print.

  • I've been getting a new line for free, so to speak.

  • At the end of printing anything has Python

  • been moving my cursor, not unlike an old school typewriter,

  • to the bottom left of the next line.

  • But sometimes I want my cursor to stay on the same line,

  • even as I do something again and again.

  • And it turns out Python--

  • and knowably know this by having looked at the documentation,

  • therefore is that the print function can take

  • a second input that is not necessarily just

  • some other string you want to print.

  • But instead it's a named parameter--

  • that is, an input that has a predetermined named--

  • in this case called n--

  • that you can set equal to a specific value like nothing.

  • It turns out, albeit non-obviously, that, by default, Python

  • ends each line with a carriage return, if you will, or a blank line,

  • otherwise represented here technically as a backslash n, which itself

  • is technically distinct from an old school carriage return

  • but has the effect of moving that cursor down to the next line.

  • So this is implicit.

  • It would be incredibly annoying if any time you wrote Python code

  • and wanted to print something that you had

  • to type out that sequence of symbols.

  • And so you get those for free, so to speak.

  • But if you want to override that default behavior,

  • you need to instead tell Python's print function, you know what?

  • End your lines with nothing at all, quote unquote with nothing in between.

  • But when I'm done printing all of them, it

  • would be nice to move my cursor to the next line so that my next prompt--

  • that dollar sign we keep seeing in my terminal window--

  • is at least on its own.

  • So I'm going to go ahead and say print open paren closed paren

  • with nothing inside that because if I get for free a blank line,

  • I don't need to pass anything to print.

  • That is as the very last step just going to move my cursor to the next line.

  • So let me go ahead and save this program now and again,

  • in my terminal, window run Python of score.py, typing in this time 10.

  • And there do I get my 10 hashes horizontally.

  • So it turns out Python has what are called types, and the only time you

  • really need to know or care about this is when, frankly, it

  • starts to bite you, like it did us.

  • Indeed, when I asked for the user's input

  • and expecting an integer but the user typed exactly

  • that but I didn't convert it in advance to an int,

  • I got ultimately that weird behavior of concatenating one string to another.

  • So underneath the hood are there are any number of types built into Python--

  • a bool like true/false, integers like numbers, strs or strings of text,

  • and even floats, real numbers that have a decimal point

  • and some number of digits after.

  • But beyond that are more sophisticated data types or data structures still.

  • Dict or dictionary, which is as we'll call it a hash table of sorts,

  • list which can be any number of values back to back,

  • a range of values as we've just seen, or a set wherein

  • you have no duplicate values and a tuple, not unlike x comma y or latitude

  • comma longitude.

  • But it turns out that an appreciation of these types

  • can help you avoid some very serious mistakes in code

  • because it turns out that depending on how you store your data in a computer's

  • memory, you might actually get behavior that you didn't actually intend.

  • For instance, let me quite simply write a program that prints out

  • the value of, oh, say, 1 divided by 10.

  • I have here a file called, say, imprecision that I quite simply

  • am going to do this--

  • prompt the user for an input called x, converting their input to an int,

  • as always, asking for x, and then defining another variable called y--

  • this time, converting to an integer the user's input after prompting for y.

  • And then quite simply, I'm going to print x divided by y.

  • Let me go ahead and save this and in my terminal window

  • run Python of imprecision.py, hitting Enter here,

  • typing in, say, 1 divided by 10.

  • And so 0.1 is the answer, just as you'd expect.

  • Let's go ahead and print out more digits than 1 after that decimal point,

  • just to make sure that 1/10 is indeed 0.1 with implicitly an infinite number

  • of zeros to the right.

  • Well, let me go ahead just for simplicity's sake

  • and first store the value x divided by y and a third variable z.

  • And then in my print statement here, let me print out exactly that z,

  • but let me format it a bit differently than usual.

  • Using Python support for an or format string,

  • using that prefix f, which connotes give me a format string,

  • am I going to print exactly that quote unquote z.

  • But recall that you need to surround it with those so-called curly braces

  • to make clear to Python that you want to plug in its value and not literally z.

  • Well, it turns out there's additional syntax we can use, albeit cryptic,

  • via which to tell Python, yes, print z but to this many decimal places.

  • And the syntax for that is a colon right after the variable,

  • followed by a literal period, and the number of decimal points

  • that you'd like to print.

  • And because this is a so-called floating point value or real number,

  • we need one additional F. Now with this syntax

  • should I be able to print that same value but to a specific number

  • of decimal places.

  • Let's see.

  • In my terminal window, let me go ahead and run Python and imprecision.py,

  • again inputting 1, again inputting 10, and whew,

  • I indeed see point 1 followed by nine more zeros--

  • a total of 10 digits.

  • Well, let me get a little more curious and instead print out, oh, shall

  • we say 20 decimal places--

  • again running my program in precision.py,

  • inputting 1, followed by 10, and all looks almost good until wow, 5, 5, 5.

  • I'm a little curious now as to what's going on.

  • Let me go ahead and run this one last time after printing 30 decimal places.

  • Here I'm going to go ahead and run Python of imprecision.py,

  • hoping that the I'm not going to get worse, and it does.

  • It seems if you look far enough out, you start to see some weirdness.

  • In fact, let me go as far as out as--

  • I don't know-- 55 decimal places, going ahead and running

  • Python of imprecision.py, and putting one in 10.

  • Oh, my god, it does get worse.

  • So it would seem that all of us taught in grade school that one divided by 10

  • indeed equals 1/10 is not quite true.

  • If you look far enough beyond the decimal point, eventually,

  • things go horribly, horribly awry, and that is because computers quite often,

  • as powerful and sophisticated as they are, can't quite do everything

  • and can't quite do everything that we humans do.

  • Now why is this?

  • It would seem that Python is ever so slightly

  • off when it comes to the representation of this floating

  • point or this real value.

  • Now why is that?

  • Well, inside of a computer is hardware like this, RAM or Random Access Memory,

  • which is a little chip of memory inside of your computer wherein files

  • and programs or stored when they're open or running.

  • And inside of each of these black chips is some number of bytes or bits

  • that are ultimately used to represent any of the values in your program.

  • The catch here, though, is that this device, like any physical device

  • in the real world, has only a finite amount of space or capacity, which

  • is to say, no matter how big or expensive this particular RAM is,

  • it has a finite number of bytes--

  • maybe one billion if it's a gigabyte or two billion if it's two gigs.

  • But it's a finite number in total.

  • And by default, what Python and most languages do is

  • they decide a priori how many bits or bytes

  • to use to represent any of the values in your program.

  • And so if your number is so precise or so big

  • that it can't quite be represented in only that many bits,

  • the language, like Python, is going to come as close as it can

  • and represent that value with some approximation.

  • And that's what we're seeing.

  • One divided by 10 is surely a mathematically well-defined number.

  • Indeed, it's 1/10 or 0.1 and mathematically should be 0.10000 ad

  • nauseum infinitely.

  • But in Python, if you're only using, say, 32 or 64 any number of bits,

  • you can't possibly represent the infinite number

  • of numbers that exist in the world and represent all of them

  • perfectly, precisely.

  • To do so, you would surely need an infinite number of bits,

  • and we don't have that in our physical world.

  • And so you have to suffer, unfortunately,

  • this potential for floating point imprecision where values you care about

  • are going to be close but not quite what you

  • intend, unless you, the programmer or designer of the system,

  • are willing to spend more than just 32 or 64 bits

  • but more and more and more and enough that you

  • can get that decimal point and those values as far off to the right,

  • so to speak, as you can.

  • But darn it, if there isn't another problem that derives

  • from precisely the same constraint.

  • Not only can you have imprecision when it comes to floating point values,

  • even integers are potentially flawed, not necessarily in Python

  • because in the latest version of Python have they designed in the language

  • the ability to use as many bits as you need to represent integers--

  • specifically, numbers like negative 1 and 0 and 1 and everything

  • to the left and everything to the right.

  • The language itself will use more and more bits

  • to store exactly the integer you want, but that

  • did not used to be the case in Python and is still not the case

  • in some languages.

  • Some languages are vulnerable to what's called integer overflow whereby

  • if in that language you try to count so high that you need

  • to represent a number that's too big to fit in the amount of storage

  • you've allocated--

  • 32 or 60 or some number of other bits--

  • you're going to overflow the value.

  • The result of which is that you might be going up and up and up and up

  • and representing a bigger and bigger value.

  • But it gets so big that all of the ones in that number become zeros.

  • And somehow accidentally.

  • you end up overflowing and starting all over numerically.

  • Now how might that be?

  • Well, consider a number that's represented with only three digits,

  • and let's start counting, for instance, from 123.

  • Adding ones to that gives you 124, followed by 125, 126, 127, 128, 129.

  • And what do we do in our human mathematical world?

  • Well, if you were about to hit nine and we now need to go to 10,

  • you don't just write 10.

  • Rather you write zero, and you carry the one, so to speak,

  • continuing now with your logic, adding that one to that two, giving you 130.

  • And that's OK.

  • We've stayed within the confines of that three-digit number.

  • But of course, if we go count up long enough, we'll eventually reach 999.

  • But if we have decided to only allocate three digits for this number

  • where or what is going to happen when we add one number to this?

  • Well, you might be inclined to carry the one and carry then one again.

  • And in the world where you have the luxury of pen and paper,

  • you might simply write down 1,000, which is the right value.

  • But if your computer or device is only representing values with three digits,

  • you have overflowed this particular value,

  • and you've overflowed in the sense that even though I have it here

  • on the screen, that doesn't actually have room in which to fit.

  • And so your number 1,000 becomes mistaken for 0 0 0,

  • thereby having you've overflowed and wrapped around from a big number

  • to a small.

  • Now you might think that this is fairly contrived and why would

  • you ever do something so foolish as to only represent numbers

  • with three digits.

  • Well, we humans have done worse.

  • Now it wasn't all that long ago that we humans made precisely this mistake

  • using just two digits to store years.

  • After all, if almost all of your dates start with 1900-something,

  • you might as well just store those last two digits.

  • Unfortunately, by December 31, 1999, were many of us quite a bit nervous

  • that we hadn't found all of the code and all of the devices in the world

  • that we're still using just two digits because, as

  • with integer overflow, if you have a number already counted up to 99

  • and you only have two digits, you might run the risk that 99 rolls over,

  • overflowing to 0 0.

  • And all of a sudden, it is not the year 2000 but 1900 again--

  • quite simply the result of integer overflow

  • by using a fixed amount of memory to represent

  • something and not having anticipated that you might eventually need more.

  • But you can certainly design for this and engineer defenses against this.

  • Indeed, some games have done better than we as a society.

  • Indeed, this game here Lego Star Wars has a point system wherein

  • you can accumulate coins over time.

  • And if you play this game long enough, you

  • can accumulate apparently as many as 4 billion of these coins

  • but unfortunately no more because the engineers

  • who designed this game decided that the maximum number of points you can accrue

  • is just that--

  • 4 billion.

  • But why?

  • Well, it turns out that in many computers and game consoles,

  • it's conventional to store your integers or ints 32-bit values--

  • 32 zeros or ones back to back, which means

  • you have 2 to the 32 possible permutations

  • thereof, which means you have roughly four billion possible values.

  • You actually have a few more than 4 billion,

  • but it's perhaps cleaner in a game to just choose

  • a clean value with lots of zeros.

  • But in this game did they anticipate that you'd play this game too long,

  • and you might eventually overflow.

  • And who knows what might happen to that gamer

  • if he or she plays the game so long, and all of a sudden,

  • their high score becomes zero.

  • So these problems are solvable.

  • You just have to anticipate and actually engineer those solutions.

  • But sometimes we don't, including companies like Boeing.

  • It wasn't all that long ago that the Boeing 747 had a software bug in it

  • whereby the plane's power system might actually

  • turn off while the plane in the worst case were actually flying.

  • One article put it as follows.

  • A model 787 airplane that has been powered continuously for 248 days

  • can lose all alternating current--

  • AC electrical power-- due to the Generator Control Units,

  • GCUs, simultaneously going into failsafe mode, the memo stated.

  • This condition is caused by the software counter

  • internal to the GCUs that will overflow after 248 days of continuous power.

  • Boeing is in the process of developing a GCU software upgrade that

  • will remedy the unsafe condition.

  • Another website analyzed the situation as follows.

  • A simple guess suggests that the problem is a signed 32-bit overflow as 2

  • to the 31st power is the number of seconds in 248 days multiplied by 100--

  • that is, a counter in hundreds of second.

  • Which is to say it is presumed that Boeing had stored some form of integer

  • in its own software, and that integer was

  • representing the number of hundreds of seconds

  • for which the power had been on.

  • But if you're only using 32 bits and only have at your disposal

  • roughly 4 billion one hundredths of seconds, turns out mathematically,

  • after 248 days, that counter, which is clearly important,

  • might overflow and wrap around to zero, the result of which

  • is that the power of an airplane might shut off.

  • And so the temporary work around, before the software upgrade was deployed,

  • was to quite literally reboot the airplane while on the ground

  • before that 248th day.

  • Now we've only just scratched the surface

  • of what you could do with Python, and in fact, we've

  • looked at some of those characteristics that are demonstrative of the features

  • that you might find in any number of programming languages.

  • But we're not even limited to the functions that come

  • built into the core language itself.

  • It turns out there are called libraries and frameworks and yet more

  • in any number of languages that provide additional features that you somehow

  • have to load or import manually in order to use.

  • One such example might be in Python.

  • If you want to generate random or technically pseudorandom numbers

  • to create some kind of variation in how your program behaves,

  • it turns out that you can't just call a random function right out of the box.

  • You need to tell Python to please load or import that feature for you.

  • So let's go ahead and write a program in a file called pseudorandom.py

  • that allows us to generate a random number between, say, 1 in 10.

  • I want to go ahead, though, first and do this.

  • From a library called random, go ahead and import a function

  • called randint for random int.

  • And then if I want to go ahead and generate or select and then

  • print a pseudorandom number between 1 and 10 inclusive,

  • I can simply do print rand int 1 comma 10,

  • and the effect will be to let Python somehow figure out

  • how to choose a number between those two values

  • and return to it to me so that I can print it.

  • Let's go ahead and save the file and then in my terminal window

  • run Python of pseudorandom.py and 10.

  • Let's go ahead and run it again.

  • And this time, I get six.

  • Let's go ahead and run it yet again, and this time, I get 10.

  • Yet again, let's go ahead and run it, and this, time I get one.

  • And if I were to run it shall we say an infinite number of times, over time,

  • I would see a uniform distribution, hopefully, of those 10 possible values.

  • And that's what's meant by pseudorandom itself.

  • It turns out that languages and computers more generally can't really

  • pick a random number off the top of their head like you

  • and I can, rather they need to use algorithms, which themselves

  • are deterministic processes-- code that does the same thing again

  • and again in order to create, if you will, the illusion of randomness

  • by creating a statistically uniform distribution over some range of values.

  • Now often, a computer will use something that's

  • changing, like the clock that's built into it,

  • taking a look at the current time, and then generating

  • a random number or again pseudorandom number based on that variation

  • or if it has a microphone or a camera taking some ambient noise of sorts

  • and using that to feed into whatever algorithm it's

  • using to choose something randomly.

  • But suppose I want to now do something with this value and not just print it.

  • Suppose that we want to implement a bit of a game for a user--

  • pick a random number between 1 and 10 and see if they can guess it correctly.

  • Well, let's see how a program might like that might be implemented.

  • Let's first go ahead.

  • And from that library called random, import as before,

  • a function called rand int.

  • Although it turns out if you want to use not only this function or others,

  • you can more succinctly just say import random.

  • The difference being if you only import random,

  • we are going to have to prefix with the word random followed

  • by a dot every use of a function.

  • So we'll do it that way this time.

  • Let's go ahead and declare a variable called n

  • and assign to it right to left the result of calling rand int.

  • But this time, because we've not mentioned rand int by name,

  • I need to qualify this symbol and say random dot rand int,

  • thereby making clear to Python that the function I'd like you to call

  • is actually inside of that library called random.

  • But I can otherwise use it as before passing in two values--

  • a lower and upper bound like this, one comma 10--

  • and that should give me an n, a random number in that range.

  • Now I want to go ahead and ask the user for their guess,

  • and I'll go ahead and define another variable called, say, guess.

  • That is the result of converting to an int whatever

  • the user's input is for their guess.

  • And now, quite simply for this game, I want to compare those two values

  • and print, say, correct or incorrect.

  • And so I'll go ahead and say if the users guess equals equals n,

  • go ahead and print out as much correct.

  • Else if the user's guess is not right, let's implicitly

  • infer that, nope, incorrect.

  • And so we'll print that instead.

  • Let me go ahead and save this file now and run Python of guess.py.

  • This time, I'll be prompted for my guess.

  • I'll say five.

  • Unfortunately, it's incorrect.

  • Let's go ahead and play again this time running Python guess.py.

  • This time, I'll go with 10 since we saw it so many times before and correct.

  • I'll run it a third time and see what's going on.

  • This time, I'll guess one but incorrect.

  • Unfortunately, I've implemented perhaps the most frustrating game ever

  • because I'm not even telling the user what the actual number was.

  • But surely, I could do that by printing the value of the variable

  • I stored that random int in, but that would be yet another game altogether.

  • All right, let's now bring all of this together and solve an actual problem--

  • one say from yesteryear.

  • You might recall this game here, Super Mario Brothers, the original, and this

  • was a two-dimensional world built up of images like this.

  • Well, there in the sky so, to speak, do I see four question marks.

  • And that seems like an opportunity to do something again and again.

  • How might I print out for question marks in a row, not nearly as graphically

  • as that here but just with my terminal window and text editor?

  • Well, here let me go ahead and open those two.

  • And in a file called, say, mario.py, let me go ahead and print out,

  • quite simply, four questions.

  • I'll go ahead and print out quote unquote question mark question mark

  • question mark question mark.

  • Saving that file again in mario0.py, running in my terminal window

  • Python of mario0.puy and voila do I get an approximation

  • of what Nintendo did in yesteryear.

  • Now of course, doing something again and again

  • is clearly an opportunity for, say, a loop and not just printing it

  • all at once but just doing something again and again.

  • And while this will complicate the code initially,

  • it sets us up for a more interesting solution thereafter.

  • In a file then called mario1.py, let me go ahead

  • and implement that same sequence of question marks

  • but this time using that familiar loop, not a while loop or an infinite loop

  • but perhaps just a for loop like this.

  • For I in the range of 0 up to, but not through 4,

  • go ahead and print out just one question mark.

  • Saving that file brings me now to my terminal window in Python of mario1.py

  • enter.

  • Unfortunately, I have created not quite the right level.

  • But that's OK because remember that with print, you

  • get one new line for free every time you call it,

  • unless you override that default behavior.

  • So let's say, no, Python instead and your line with nothing,

  • and only once I'm completely done do I want you to print

  • one of those free new lines for me.

  • I'll go ahead and save my file again in my terminal

  • window, rerun Python of mario.py, and now we have the same exact result.

  • But later in the game do we see different aspects of Mario's world, not

  • unlike this thing here underground.

  • Pictured here are a number of blocks in the underworld,

  • and it looks to me like that bigger block

  • there is a composition of, say, 4.

  • So let's go ahead now and print out a block of bricks,

  • these are all represented by hashes so that I have four horizontally,

  • four vertically as well, and everything else filled in too.

  • We've not yet printed out anything on multiple axes,

  • if you will, both rows and columns of sorts.

  • So how to do this?

  • Well, in my terminal window, I'm going to create a file called mario2.py.

  • And in this file, I'm going to decompose this problem conceptually, so to speak,

  • into two different problems.

  • Built into that underworld are some number

  • of rows of bricks within which are these columns.

  • And I bet I could bite those off each one at a time.

  • So let me go ahead and do this.

  • For I in range of 4, go ahead and print what?

  • Well, for every row of bricks, do I want to print some number of columns too?

  • Because it's a square, the same number of columns as rows and so I

  • know how to print that many things too.

  • I can simply use another loop perhaps with a different variable

  • with which to count like for j, as is conventional in range also of 4.

  • And then inside of this inner nested loop, so to speak,

  • might I go ahead and print out just one hash ending each of my lines

  • with, as before, nothing.

  • In fact, I only want to move my cursor to the next line

  • after I've printed each of those columns.

  • And so only underneath that innermost loop do I want that call to print.

  • Let me go ahead and save this file now and run Python of mario2.py.

  • And if I've gotten this logic right, I can go ahead and print out

  • those rows and columns too.

  • And indeed, that's what I get.

  • It's not quite a perfect square because those hashes are

  • a little taller than they are wider.

  • But via a for loop, one nested inside of another can

  • I handle two problems at once--

  • the act of iterating from row to row to row and within each row, iterating

  • left to right via column.

  • And in fact, to make that ever more clear, why

  • do I even call my variables more aptly.

  • For each row in the range of four in each column in the range of four,

  • go ahead and print each of those hashes.

  • Frankly, it doesn't even matter what I call these variables because I'm never

  • actually using them per se.

  • I'm simply telling Python to count up from 0 to 4 using

  • those particular names.

  • Those then are some programming languages--

  • Python especially among them.

  • And just as we saw in pseudocode, the ability to express functions

  • and conditions with Boolean expressions and loops and then things like

  • variables and more, so can we express those exact same ideas in Python, in C,

  • C++, and Java.

  • And with each of those languages do you get different features,

  • with each of those languages do you get different techniques.

  • But ultimately, those languages are all just

  • tools for one's toolkit with which to solve any number of problems with data.

[MUSIC PLAYING]

字幕と単語

ワンタップで英和辞典検索 単語をクリックすると、意味が表示されます

B1 中級

弁護士のためのCS50 2019 - プログラミング言語 (CS50 for Lawyers 2019 - Programming Languages)

  • 6 0
    林宜悉 に公開 2021 年 01 月 14 日
動画の中の単語