This lab is a collection of C programming exercises with an emphasis on strings (i.e., arrays of characters) and memory management. The first is an introduction to a tool for finding memory leaks, and the second is a simple exercise on 1-D arrays of characters
For more information (including information on how to use valgrind
), see
the pre-lab.
Each of these exercises comes with a set of tests implemented using the
Google Test framework for C, aka
gtest
. You won't have to learn anything about gtest
, but you
will need to be able to compile and run the tests that we provide.
We think the tests are pretty reasonable, but make no promises that they are in any way complete. Obviously you want your code to pass at least these tests, but you shouldn't assume that passing these tests guarantees any kind of correctness. You're welcome to read the tests and extend them if you'd like. You may even need to make changes to the test code to handle memory leaks (see "fixing memory problems" below). Do be careful to not remove or weaken the tests, though; at a minimum you definitely want to be able to pass the tests as given.
Passing the tests is arguably just the first half of each of these problems,
as it's entirely possible to pass the tests but still have substantial memory
management problems. You could, for example, have an illegal memory access
that passes the tests through happenstance, because the right value happens
to be in the necessary location. Or you could have memory leaks because you
never free up any of the memory you allocate while solving the problem in a
manner that is otherwise entirely correct. This is where valgrind
is
extremely helpful, as it will help identify these kinds of memory problems
even if the tests pass.
f()
that returns an
array or string that is allocated somewhere in f
(or a function f
calls),
then that memory is lost if the test code doesn't free up that returned array.
So if valgrind
says there's a leak where some memory is allocated in a function and then returned to the test code, then the fix is
in the test code. In general we don't encourage you to fiddle with the
test code (you could always just change the test code to say everything
passes!), but if the memory leaks to the test code, then that's where the
fix has to be made.
There are several directories here, one for each project. We would recommend doing them in the order listed below; there's no overwhelming reason that you need to do them in any particular order, however, and it would be far better to move on to the next one rather than get stuck in one and not make any progress.
The basic structure for each part of the project is (for an imaginary
part frogs
):
frogs.h
, which specifies the name, arguments, and return type of the function(s) you're supposed to write.- In every case we wrote one or more helper functions, but these
don't have to be included in the
.h
file unless you want to include them in the tests.
- In every case we wrote one or more helper functions, but these
don't have to be included in the
frogs.c
, which includes the initial stub (i.e., a simple incorrect version) of the program you're working with in that part.main.c
, which gives you a "main" function that you can use to run your code independently of the test code. This is how people would normally run your program outside of the test harness, so you want to make sure that it doesn't have any memory leaks. You might also find running your code viamain
to be useful in debugging.frogs_test.cpp
, which is the test file we wrote usinggtest
. The.cpp
ending is because this is actually a C++ file not a strict C file. That will affect how you compile the test code, but you won't have to know/learn anything about C++ for this lab.
Your job then is typically to complete or fix frogs.c
, providing
implementations of the functions listed in frogs.h
.
To compile the main
use the following:
gcc -Wall -g -o frogs frogs.c main.c
(where you replace frogs
with the appropriate name for the project
you're working on). If all goes well, that should generate an executable
frogs
that you can run with ./frogs
.
To compile the test code use the following:
g++ -Wall -g -o frogs_test frogs.c frogs_test.cpp -lgtest -pthread -std=c++0x
Notice that this uses g++
instead of gcc
. This because the gtest
is technically a C++ library, but it also works for "plain" C code, which
is all we need it for here.
There are several flags here:
- The
-Wall
turns on all warnings (W
is for "warnings",all
specifies that we want all of them). This isn't necessary, but the C compiler will quietly do very unexpected things in a variety of circumstances, so it's almost always a good idea to turn on all the warnings so you get a heads up when something odd is happening. - The
-g
flag isn't strictly necessary; it causes a variety of useful debugging information to be included in the executable, however, which can be extremely helpful when using tools likevalgrind
or thegdb
debugger. If you don't include it, for example, then those tools won't be able to report accurate or useful line numbers or function names. - The
-lgtest
tells the compiler to include thegtest
library (that's the-l
part) when generating the executable. - The
-pthread
tells the compiler to use the PThread threading library for C/C++. The test library uses pthreads, and you need to tell the compiler that so it uses the right threading tools. - The
-std=c++0x
flag tells the compiler which version of C(++) that you want to use. There have been a number of versions of C/C++ over the past few decades (similar to the different versions of Java), and this says we want to use thec++0x
standard from the 2000s (hence the0x
). That's necessary or the test code won't compile correctly.
We've set up GitHub Actions to automatically run six tests whenever you push new code up to your repo. All of these are also reflected (perhaps with some delay) with the badges at the top of this REAMDE.
palindrome-gtest
runs the GTest tests on the Palindrome problem; the badge is red if any of those tests fail.palindrome-main-valgrind
runsvalgrind
on themain
program for the Palindrome problem; the badge is red if there are any memory leaks in runningvalgrind
on that executable.palindrome-test-valgrind
runsvalgrind
on the executable containing the GTest tests; the badge is red if there are any memory leaks in runningvalgrind
on that executable.
There are then the same three tests, but for the Disemvowel problem.
- Pass our tests.
- Have no memory leaks, as confirmed by
valgrind
. - Remove any print statements that you used to debug your code before you turn it in.
Also, please don't lose your brains and forget good programming practices just because you're working in a new language. C can be quite difficult to read under the best of circumstances, and using miserable names like res
, res2
, and res3
doesn't help. Use functions to break up complicated bits of logic; it's really not fun when a group turns in a solution that is one huge function, especially when there are several instances of repeated logic.
Some things to watch our for:
- In the past there has been strong inverse correlation between length and correctness on these problem. If you find yourself wandering off into 2 or (especially!) 3 pages of code for any of these, you've likely lost the plot and should probably ask for some help.
- Make sure you initialize all variables (including variables used to index arrays in loops). C won't give you an error if you fail to initialize something, and sometimes you can get lucky and your tests will accidentally pass because, at least that one time, you happened to get the "right" initial value. That doesn't mean your code is correct, though.
- Make sure you allocate space for the null terminator
\0
when allocating space for strings. - The "main" functions (
main.c
in both projects) expect you to type input at the keyboard and will keep running until you "tell" them that you're done typing. Use^D
to tell the system that you're done typing; seeTips_and_suggestions.md
for more on this. This also applies when you're runningvalgrind
on the main functions.
There are more comprehensive tips and suggestions in
Tips_and_suggestions.md
in the repository.
Before you start writing your own C code, we'll start by using valgrind
to identify memory leaks in an existing program. In the
palindrome
directory there is a program that
determines (in sort of a dumb way) if a string is a palindrome. The file
palindrome.c
has the code that checks for palindromes and (instead of
doing the more obvious thing of returning a boolean) returns the string
"Yes" or "No". The file palindrome_test.cpp
uses the Google Test
(gtest
) library
mentioned above to test that the palindrome
function works on a variety
of examples.
To run the tests you should
go into the palindrome
directory in your project and compile the
program:
g++ -Wall -g -o palindrome_test palindrome_test.cpp palindrome.c -lgtest -pthread -std=c++0x
Run the resulting executable and verify that all the tests pass.
You also want to make sure there aren't any memory
leaks in main.c
, and it might be fun to be able to
run it "by hand" so you can type random things at it.
So compile main
:
gcc -Wall -g -o main main.c palindrome.c
You can run this with ./main
and then type lines of text. The program will
then tell you whether each line you entered is or isn't a palindrome. You can
use ^D
to end the input, or ^C
to kill the program. You can also run it
on some "canned" input using ./main < sample_input.txt
.
Look at the code a little and see if you can spot any obvious memory leaks.
Run valgrind
on both your test executable
valgrind --leak-check=full ./palindrome_test
and the main
executable, e.g.,
valgrind --leak-check=full ./main < sample_input.txt
and see what it tells you
about memory leaks in this code. Then go through and fix the memory
leaks so that valgrind
is happy (and the tests still pass). As mentioned
above, this might involve making changes to palindrome.c
, or to main.c
,
or to palindrome_test.cpp
, or some combination of the three files.
valgrind
says that
both palindrome_test
and main
are free of memory
leaks.
The output of valgrind
can be a bit overwhelming sometime, so definitely
ask questions if you're not sure how to interpret what you're seeing.
"Disemvoweling" is the act of removing all the vowels ('a', 'e', 'i', 'o', and 'u', both upper and lowercase) from a piece of text. Your task here is to write a function
char *disemvowel(char *str);
that takes a null-terminated string, and returns a new null-terminated string (i.e., it doesn't modify the original string) that contains the same characters in the same order, minus all the vowels. Note that resulting array of characters will need to be allocated, and will typically be shorter than the input string. It would be desirable to not waste memory and only allocate what you actually need for the return string.
So if the input string is "goose"
you'd want to return the string
"gs"
and you'd want to make sure you only allocated three characters
for the result (two for 'g' and 's' and one for the \0
at the end).
Again, make sure that both disemvowel_test
and main
are free of leaks. You might find valgrind
useful for helping
check for leaks.
Your final repository should have:
- Updated palindrome code that still passes the tests, but has no memory leaks according to
valgrind
.- This could involve changes to any and all of
palindrome.c
,palindrome_test.cpp
, andmain.c
.
- This could involve changes to any and all of
- A working implementation of disemvowel that passes all the tests and has no memory leaks according to
valgrind
.- As well as completing
disemvowel.c
this could involve changes todisemvowel_test.cpp
and/ormain.c
.
- As well as completing
You shouldn't need to change any of the .h
files for either part.