Is it possible with current 3D printers to print a sound trace?

Question

Is it possible with the accuracy of current 3D printers to print a sound trace?

On a vinyl record the grooves in the record are an encoded sound. Is something like this doable with 3D printers?

If Vinyl-like isn't possible, could a sound be printed at desktop scale? I mean printing the waves out that if you ran your finger along it it would reproduce the encoded sound? Examples would be Rumble Strips, the Musical Roads or highway rumble strips.

The practicality and the quality would depend on the size of the needle (and the RPM) that is used to play the sound back. I would guess that it is possible but the sound quality would be so poor that it wouldn't be worth it, except for academic purposes. Or are you asking if it is possible to print to a resolution comparable to that of a conventional gramophone record? If the latter, then I would guess not (yet). You'd need to compare the physical resolution (in microns) of record grooves to the smallest movement possible on a 3D printer. IDK but I'd guess they are at least a magnitude apart. — Greenonline, Oct 11 '19 at 22:07
Just a comment, since I feel like adding this as my own answer would plagiarize what's already here, but I also want to sum up the two answers so far, since each covers a different aspect of the question. In short, we're getting close, but current technology can't do the equivalent of vinyl record playable on your parent's hi-fi phonograph machine. The grooves are too fine. But you _**could**_ print a similar device, where the grooves are much wider, capable of reproducing intelligible human speech or even music, albeit at lower quality than vinyl. — Joel Coehoorn, Oct 14 '19 at 14:37
@JoelCoehoorn No reason the fidelity can't be decent so long as you allow a faster speed (think 78 vs 33&1/3) and a larger vertical displacement. — Carl Witthoft, Oct 15 '19 at 17:43

score 7 · Answer 1 · edited Jun 18 '20 at 08:29

Sound Encoding basics

Sound is a compression wave, and any depiction of it has to be an encoding of it. You can encode it so you can recreate the sound using a contraption that oscillates in the right way to compress air again in the right pattern, but you can't just "print it out" like you can scale up a lightwave from the nanometer scale to a visible one as a representation.

Let's take a simple example: a 440 Hz tune is generally considered to be the A₄, aka concert pitch a or A440.

It could be encoded in a various ways. The probably oldest is to encode it as a note in violin notation, which then could be reproduced by anyone using a properly tuned instrument. The actual result depends on the instrument used as much as on the skill of the player. Each instrument thus might decode this encoded note differently, based on the physical setup of the instrument. Each instrument automatically creates the appropriate overtones.

In Midi, it is encoded as Note 69 and any machine that can decode a midi file could use this instruction, paired with an instrument to use, to create the A₄ that is set for it. In Midi, the mere instruction of Note 69 does cut out skill, but how it sounds and feels comes from the instrument setup - which contains information about what overtones are to be created when playing this note.

For a physicist, the pure sound is encoded as just the notion of 440 Hz and some amplitude to balance how loud it is. With those instructions, he'd be able to set up a device that has these creates a 440 Hz tune. To generate the sound and feel of an instrument, the encoding for a physicist would need to contain all the overtones that are to swing with this one sound.

History of sound recording

Let's look at the very first way of recording sound: The Phonautograph of 1857 used a piece of paper or a sheet of glass blackened and then a membrane move a needle. When the plate would be moved, the needle left a written path. The encoding was done via 2 factors: the setup of the stylus (mainly how long is the arm) and the speed of the movement of the plate. Changing either changed the encoding. A longer arm would record a larger amplitude (making fainter sounds recordable) while faster movement would alter the timescale recorded, allowing to look at short instances and better compare them.

These vibration-pattern records could be used to measure and compare sounds but not be used to recreate the sound, as lines on paper nor scratches in soot are a good way to keep a reading needle in boundaries. it took till 2000 and the use of scanners as well as digital processing to recreate these recorded sounds.

The solution to recreate sounds was found by the Edison Laps in 1877 with the phonograph, which used a piece of thick tinfoil to record the motion pattern of the membrane. Again, then encoding was done via the arm setup and the speed at which the tinfoil clad cylinder moved (or rather rotated). It would till the 1880s develop to a wax cylinder, which was easier to inscribe and reproduce from. One such machine was used by Carl Orff.

The first Gramophone came in 1889, mainly altering the shape of the recording medium from cylinders to the well-known shape of vinyl records but made from hard plastics and shellac. Around 1901, a 12-inch gramophone disk held only a 4 minutes track, speaking volumes about the problems of encoding the complex patterns of sound onto a disk. At the same time, an Edison Amberol Cylinder held 4 minutes 30 seconds but would spin at 160 rpm. Soon after, celluloid would become the recording medium of its time, and the disk the de-facto "standard" as it was much better storable.

In 1925 finally, a real standard was developed to record at around $78^{+0.26}_{-0.08}$ rpm, which lead to only a 0.34 rpm difference between areas of 60 or 50 Hz mains voltage (though they needed different encoder rings), making records interchangeable between both machine types. All these recordings were encoded naturally: the vibrations of the membrane in the recording tool would be 1:1 transmitted to the vibrating stylus that would then do the encoding in such a way that a machine would reproduce what the recording one "heard" quite accurately.

When Vinyl came to the playing field as a recording medium at the end of world war II, so came a swap in the reading needle type: instead of a needle that would agitate a membrane directily, sapphire needles that would agitate an electrical pickup which in turn would activate a speaker. But while the recording technology advanced, the track length of a 12-inch disk was still limited to about 4 minutes at 78 rpm. It would only reach more than this in the last years of its use by applying LP technologies to pack the track tighter in the 1950s, achieving 17 minutes.

1948 came the LP, what we know as a classic vinyl record. At its introduction it could cram 23 minutes onto one side, making this possible by only using 33.5 rpm as the recording speed and thinner, much tighter coiled groves, increasing the information density by a factor of 5.75 for a 12-inch disk. 7-inch 45 rpm "singles" came out 4 years later. Within 10 years, the 33.5 and 45 rpm encoded variants had almost completely replaced the 78 rpm market.

Vinyl

As the history of analogous recordings shows, encoding a sound signal is rather easy in theory, hard in practice. A typical 12-inch LP Vinyl record of 20 minutes is a grove that is 427 meters long and coiled up 667 times. That means a single groove is between 0.04 and 0.08 mm wide - with an equally thin wall between. That means, that to achieve a printed phonograph record, you'd have to print accurately down to 40 microns to get an empty track. However, we also need to add the signal atop. And here comes the real problem:

An empty track has some 22 µm deviations, which the needle will usually not pick up at all. Dust, which creates the crackling at times, is in the same area (1-100 µm). The actual sound signal is encoded to have features as small as 75 nanometers. That is 3 magnitudes lower than the mere geometry of the grove, and equally much lower than any printer - including SLS - can achieve today, as 50 µm is often considered a lower limit in 2019.

To show how much tiny defects would ruin the sound quality, look at this rapid cast of a vinyl record. The resolution of the negative and the subsequently cast record is good enough to recognize the music, but the resin cast did contain so many gas bubbles that the noise level of the copy is very high.

Bonus: Unlike on cylinders the encoding of the signal on disks changes from the start to the end! The vinyl spins at a constant rate, but the radius from the center changes, leading in the speed on any part of the grove to be different as $|v|=|\omega \vec r \sin(\theta)|$, where omega is the speed in rad per second, theta is the angle of the reeding, so in this case, the sinus term becomes 1 and vanishes. This factor has to be taken into account for encoding so the pitch of the record doesn't change if the record is not created naturally by inscribing the signal onto a spinning disk.

Other encoding

Rumble Strips

However, it is quite easy to create a structure that creates sounds based on interaction with another body. Highway Sound Strips create sounds as the car tire bumps up and down, turning the car and tires into resonance bodies while the street "beats" upon it. In the case of a large percussion instrument like a car, we are talking centimeter scale.

Peg-Cylinder

A very simple method would be to go back to encoding and check out the note notation but limiting the length of notes to one unit. Encoding music this way results in pegs or ridges on a cylinder, which then can be used to actuate a mechanism to decode the music and create sounds like in a music box. In a music box of this kind, the demand for accuracy is about 3 to 5 magnitudes lower than in vinyl records: we speak about a tenth of a millimeter to centimeter scale.

Such a Musical box or noisemaker can be easily printed and is pretty much a rumble strip coiled around a cylinder. The length of the sample is determined by the resolution, playback speed and diameter of the cylinder while the complexity is determined by the rows of pegs of it: a noisemaker is pretty much a 1-note, high speed, music box. Typically, one rotation stores about 25 to 30 seconds. Typical examples would be the first part of Für Elise, or the Marble Machine (Between second 30 and 35 the encoding wheel rotates 1 fifth). Some barrel organs also use the peg method, like one can see here. With some trickery, one cylinder could be used to encode multiple parts that play one after another once a rotation is done by and silencing some parts of the machine depending on an extra encoder, like this 3-part Für Elise music box.

Hole-Plate(-strip)

A different method would be to encode the music as holes in a continuous strip and use air as a decoding method. If the air then gets directed into pipes, we have a street organ. Typically, one would use a paper strip as the encoded message, but it could be printed just as well, especially if one uses a setup that uses plates hinged to one another instead of a rolled-up paper as in this example. With such a way to stash away the extra length, the upper limit for music length rises from a couple of seconds to several minutes easily even with such a "bad" encoding.

@Trish Thanks. I know a little bit about sound. I'm talking about taking an actual recorded sound, any sound, prerecorded and printing out the wave lengths to make a physical conversion. I know it would be a very short sound. I'm thinking trenches. — 1.21 gigawatts, Oct 12 '19 at 01:08
@1.21gigawatts recorded sound itself is not printable. What you see on your sceen is not *sound* it is the graphical representation of the mathematic analysis of the physic measurement made by the microphone. It is a representation of the instructions to a physicist "mix these vibrational patterns to create this sound". You can print a device that *creates* vibrational patterns - like a music box - or a *mathematical representation* - like a graph of the fourier analysis - but you can't **print** a vibrational pattern itself. — Trish, Oct 12 '19 at 14:53
@1.21gigawatts added more about the limitations of peg encoded and found a way out to get pretty much an arbitrarily long and complex piece of music provided one has the fitting decoder machine. — Trish, Oct 15 '19 at 19:17

N. Virgo · Answer 2 · 2019-10-13T18:55:43.680

I think this is just about doable. In this answer, I will assume you want to produce a "rumble strip" style of object that will reproduce a recording of human speech. I'll assume you don't care about sound quality, you just want the words to be intelligible.

The main things to consider are the printer's resolution, the size of the object to be printed, and the sample rate. Together, these factors determine the length of the sound, and the rate at which you need to move along it to reproduce the sound.

Let's start with sample rate. A CD has a sample rate of 44100 samples per second (Hz), but that might be a bit ambitious. Telephones use a lower sample rate of 8000, and it says here that speech is still intelligible at a sample rate of 2500 Hz. Let's go with this rate.

Now let's consider the resolution of the printer. A typical nozzle size is 0.2mm, which probably limits the resolution to around that size, though you can probably do better with some care, and I imagine people in this community will be able to help with that. I am guessing that you would want to print the object horizontally, so you're dealing with xy resolution instead of z resolution. (Note that resin 3d printers have much better resolutions, so they might be ideal for this task, despite their smaller print volumes.) Let's start by assuming 0.2mm is our resolution, since this should be easy to achieve with any printer.

This means that every sample in the sound file takes up about 0.2mm. Let's say we have one second of speech - that's long enough to say "Hello!", for example - at 2500 Hz. That means we have 2500 samples. 2500 * 0.2mm = 500mm, so your rumble strip will be about 1/2 meter long. That's unlikely to fit on your print bed, but you can print it in sections and stick them together - you can probably print them all at the same time. You could even curl it round into a spiral, making it even more like a vinyl record.

Then all you have to do is take a rigid object like a guitar pick and slide it along the strip at the right speed, so that it takes about 1 second. Then you should hear the sound played back. Attaching a resonator to the pick or the strip should increase the volume.

Increasing the resolution will decrease the length of the strip, or allow you to play a longer sound for the same length of strip, or increase the sample rate. E.g. if you can get a resolution of 0.1mm then you could play a 2 second sound instead, using the same 0.5m length of rumble strip.

In principle, creating the object is not hard, but I don't know any software that can do it out of the box. You just need to make the surface height correspond to the waveform. If I was doing this I would probably write a Python script to turn the wave file into a list of numbers, then paste those into in OpenSCAD's polygon function, which I would then extrude to make the object. But others might know an easier way.

This is just beautiful! I was playing with the idea to generate gcode, but openscad sounds like a great idea. I guess that some simple 8-bit tune (commander Keen, super Mario) could be encoded in even less samples per second and a second of "music" might fit on a 20cm. strip. — Hacky, Oct 21 '19 at 08:59

score 2 · Answer 3 · answered Oct 15 '19 at 17:47

Here's an alternative which takes advantage of the relatively (!!) high-precision layer capability of the 3D printer: Make a lithopane strip and use an optical sensor to reproduce the sound.

This is (was) done to encode the soundtrack for movies alongside the image frames in the film strip (reel). Basically the thickness of the print at a given location modulates the optical throughput and thus the signal strength out of the photodetector.

Note that, as with movie reels, you will need a lot of real estate to record a decent amount of audio.