Ruby: Procs, Lambdas and Bindings

Procs

Yielding to blocks from methods is the simplest way to access closures. However, yield is a bit limited, because it can only invoke the block directly. It can’t send the block somewhere else to be invoked. To do that, you need a Proc object.

The Proc object wraps a block in a first-class function context. First class functions can be:

Passed as an argument
Returned as a value
Assigned to another variable

The Proc object exposes a call method that invokes its block.

To create a Proc object, you can use either of these two syntax variations:


my_proc = Proc.new do
  puts "Hello, I'm a block.
end

my_proc = proc { puts "Hello, I'm a block." }

my_proc = Proc.new do

puts "Hello, I'm a block.

end

my_proc = proc { puts "Hello, I'm a block." }

Convention is to use proc (and {}) with single-line blocks, and Proc.new (and do end) with multi-line blocks.

Here’s a bit of code that will show that procs are closures and that we can pass them as an argument:


def test_proc(a_proc)
  if defined?(message).nil?
    puts "Nope, can't see message variable from here."
  end

  a_proc.call # Proc has access to message variable, even from inside method.
end

message = 'This is a message from your friendly neighborhood Spiderman.'
msg_writer = Proc.new { puts message } # message variable is in scope for this proc.

test_proc msg_writer
#=> Nope, can't see message variable from here.
#=> This is a message from your friendly neighborhood Spiderman.

def test_proc(a_proc)

if defined?(message).nil?

puts "Nope, can't see message variable from here."

end

a_proc.call # Proc has access to message variable, even from inside method.

end

message = 'This is a message from your friendly neighborhood Spiderman.'

msg_writer = Proc.new { puts message } # message variable is in scope for this proc.

test_proc msg_writer

#=> Nope, can't see message variable from here.

#=> This is a message from your friendly neighborhood Spiderman.

Line 7 invokes the Proc#call method. This invokes the proc’s block and executes the puts message command. Although the #test method cannot directly see the message variable, the Proc object assigned to the a_proc parameter can. This is because blocks are closures, and the message variable is in the block’s lexical scope. The lexical context is the context from the written standpoint. Since we initialize message on line 10, the block on line 11 can see it. Since the block as written can see message, message is in the block’s lexical scope.

We make this distinction because we actually invoke the block on line 7, where the interior of the #test_proc method’s scope is in force. This is not the same scope as the main area, so when we say that the block carries with it its lexical scope, we are saying that the block has the scope of where we create it rather than the scope of where we invoke it.

Lambdas

A lambda is a specific type of Proc object. To create a lambda, you can use either of two syntax variations:


my_proc = lambda do
  puts "Hello, I'm a block."
end

my_proc = -> { puts "Hello, I'm a block." }

my_proc = lambda do

puts "Hello, I'm a block."

end

my_proc = -> { puts "Hello, I'm a block." }

Convention is to use -> (and {}) with single-line blocks, and lambda (and do end) with multi-line blocks.

This code shows that lambdas are a specific type of Proc object:


my_proc = Proc.new { puts "Hello, I'm a block." }
puts my_proc

my_proc = -> { puts "Hello, I'm a block." }
puts my_proc 

#=> &lt;Proc:0x000000010902f0a8 test.rb:1>
#=> &lt;Proc:0x000000010902eb80 test.rb:3 (lambda)>

my_proc = Proc.new { puts "Hello, I'm a block." }

puts my_proc

my_proc = -> { puts "Hello, I'm a block." }

puts my_proc

#=> <Proc:0x000000010902f0a8 test.rb:1>

#=> <Proc:0x000000010902eb80 test.rb:3 (lambda)>

Both the proc and the lambda are Proc object instances. The only difference is that in the lambda, the lambda flag is set. The Proc object exposes a #lambda? method that returns true if the Proc instance is a lambda.

Lambdas behave differently from regular Proc objects in two ways:

Doing a return from a block wrapped in a Proc returns in the context in which the Proc was created. Doing a return from a block wrapped in a lambda returns to the context in which the lambda was called.
Procs don’t check whether the #call method passes the right number of arguments to the block. Any missing arguments are assigned the value of nil, and any extra arguments are ignored. Lambdas throw an ArgumentError if #call passes the wrong number of arguments.

Difference One

Let’s look at difference number one first. Here’s how the Proc object behaves:


def test_proc(a_proc)
  a_proc.call
  puts 'Ok, proc is all done.'
end

message = 'This is a message from your friendly neighborhood Spiderman.'

msg_writer = Proc.new do
  puts message
  return
end

test_proc(msg_writer)
puts 'Goodbye!'

#=> This is a message from your friendly neighborhood Spiderman.

def test_proc(a_proc)

a_proc.call

puts 'Ok, proc is all done.'

end

message = 'This is a message from your friendly neighborhood Spiderman.'

msg_writer = Proc.new do

puts message

return

end

test_proc(msg_writer)

puts 'Goodbye!'

#=> This is a message from your friendly neighborhood Spiderman.

The return on line 11 returns out of the entire program, so lines 4 and 15 never run.

Now, let’s look at how the same code behaves when written as a lambda:


def test_proc(a_proc)
  a_proc.call
  puts 'Ok, proc is all done.'
end

message = 'This is a message from your friendly neighborhood Spiderman.'

msg_writer = lambda do
  puts message
  return
end

test_proc(msg_writer)
puts 'Goodbye!'

#=> This is a message from your friendly neighborhood Spiderman.
#=> Ok, proc is all done.
#=> Goodbye!

def test_proc(a_proc)

a_proc.call

puts 'Ok, proc is all done.'

end

message = 'This is a message from your friendly neighborhood Spiderman.'

msg_writer = lambda do

puts message

return

end

test_proc(msg_writer)

puts 'Goodbye!'

#=> This is a message from your friendly neighborhood Spiderman.

#=> Ok, proc is all done.

#=> Goodbye!

With a lambda, the return on line 11 returns to line 4, the line after the lambda proc call, and execution continues from there. Therefore, lines 4 and 15 run.

Difference Two

Now, let’s look at the second difference. Consider this code:


msg_writer = Proc.new do |name, msg|
  p "Hello, my name is #{name}."
  p msg
end

msg_writer.call 'Blockhead'
msg_writer.call 'Blockhead', 'I am a block.', 'I hope you like blocks!'
#=>"Hello, my name is Blockhead."
#=>nil
#=>"Hello, my name is Blockhead."
#=>"I am a block."

msg_writer = lambda do |name, msg|
  p "Hello, my name is #{name}."
  p msg
end

msg_writer.call 'Blockhead'
#=>wrong number of arguments (given 1, expected 2) (ArgumentError)

msg_writer = Proc.new do |name, msg|

p "Hello, my name is #{name}."

p msg

end

msg_writer.call 'Blockhead'

msg_writer.call 'Blockhead', 'I am a block.', 'I hope you like blocks!'

#=>"Hello, my name is Blockhead."

#=>nil

#=>"Hello, my name is Blockhead."

#=>"I am a block."

msg_writer = lambda do |name, msg|

p "Hello, my name is #{name}."

p msg

end

msg_writer.call 'Blockhead'

#=>wrong number of arguments (given 1, expected 2) (ArgumentError)

From this, you can see that the Proc objects created with Proc::new assign nil to missing arguments and silently ignore extra ones, while objects created with Kernel::lambda throw an ArgumentError when invoking #call with the wrong number of arguments.

Bindings

Consider this code:


def test_proc(a_proc)
  a_proc.call
end

message = 'You can see me!'
msg_writer = Proc.new { puts message }
message = 'No, you really can see me!'

test_proc(msg_writer) #=> No, you really can see me!

def test_proc(a_proc)

a_proc.call

end

message = 'You can see me!'

msg_writer = Proc.new { puts message }

message = 'No, you really can see me!'

test_proc(msg_writer) #=> No, you really can see me!

Notice that the closure msg_writer, even after it has been created, keeps track of the change to the enclosed variable message.

How? Well, in Ruby, everything is an object. Perhaps we can just keep track of a reference to the message object? Let’s see:


def test_proc(a_proc)
  a_proc.call
end

message = 'You can see me!'
puts message.object_id

msg_writer = Proc.new { puts 'Inside block: ' + message.object_id }

message = 'No, you really can see me!'
puts message.object_id

#=> 60
#=> 80
#=> Inside block: 80

def test_proc(a_proc)

a_proc.call

end

message = 'You can see me!'

puts message.object_id

msg_writer = Proc.new { puts 'Inside block: ' + message.object_id }

message = 'No, you really can see me!'

puts message.object_id

#=> 60

#=> 80

#=> Inside block: 80

Well, no we can’t. In Ruby, everything is an object, but also in Ruby, every time you reassign a variable, Ruby creates a new object. That’s why lines 7 and 12 have different object ids. So there has to be another mechanism to create closures. That mechanism is a binding.

Binding internals

At a low level, a binding is an object that wraps a stack frame.

A frame is a C struct. A stack frame is a frame whose members contain the state of one of the contexts that is currently executing. For example, in the above code, we have the state that is visible to #test_proc, and we have the state that is visible to main. These two sets of state are in two different frames.

When main or a method executes, a frame gets pushed onto the stack. When execution completes, the frame gets popped off of the stack.

This mechanism won’t work for closures. One frame on the stack can’t see another one, and the closure’s state has to persist beyond the context of the frame in which it’s created. What will work is wrapping the frame in an object and storing it on the heap. An object has a life of its own, so a frame wrapped in an object will persist even when the lifetime of the context that spawned the frame ends. This is why all blocks have a binding associated with them.

The `Binding` Class

The Binding class defines the binding. Whenever a block is created, an instance of the Binding class (called, helpfully, binding) is created to go along with it. This object keeps track of any changes to the state of the closure. As the doc says, “Objects of class Binding encapsulate the execution context at some particular place in the code and retain this context for future use.” When there is a request for any value contained in a block’s closure, the request is passed to the binding, which locates and accesses the value.

We can use the Binding object to get a bit of insight into how the binding works.


x = 'You can see me!'
a_proc = Proc.new { puts x }

puts "Local variables: #{a_proc.binding.local_variables}"
puts "Value of x: #{a_proc.binding.local_variable_get(:x)}"
puts

x = 'No, you really can see me!'
puts "Changed value of x: #{a_proc.binding.local_variable_get(:x)}"

#=> Local variables: [:x, :a_proc]
#=> Value of x: You can see me!
#=> 
#=> Changed value of x: No, you really can see me!

x = 'You can see me!'

a_proc = Proc.new { puts x }

puts "Local variables: #{a_proc.binding.local_variables}"

puts "Value of x: #{a_proc.binding.local_variable_get(:x)}"

puts

x = 'No, you really can see me!'

puts "Changed value of x: #{a_proc.binding.local_variable_get(:x)}"

#=> Local variables: [:x, :a_proc]

#=> Value of x: You can see me!

#=>

#=> Changed value of x: No, you really can see me!

We can see from this code that the Binding object exposes the local variables as symbols with the same name as the variables (including — as always — a reference to self, which in this case is :a_proc).

This article is one of a series of four. Here are the other three:
Ruby: Blocks
Ruby: Scope and Closures
Ruby: Block Parameters and Return Values

Robert Rodes

Software Developer

Turning ideas into software...

Robert Rodes

Ruby: Procs, Lambdas and Bindings

Procs

Lambdas

Difference One

Difference Two

Bindings

Binding internals

The `Binding` Class

Related Articles

Procs

Lambdas

Difference One

Difference Two

Bindings

Binding internals

The Binding Class

Related Articles

The `Binding` Class