Hey, guys! I really enjoyed playing around with your system. As a developer, I planned to some kind of simple audio recorder app and I was wondering if you could suggest how to implement a timeline like you. To make thing simpler, I want to ask you some questions: Did you used some already existing solutions. May you please share some of open-source libraries/tutorials/apps that inspired you to make it like this. AFAIK it is very similar to what iMovie or Final cut pro does but it is a native app and could be used as a reference but nothing more. How can properly I implement resizing a cutted section so it will be extended with an original soundwave that has been cut before. Do you have any suggestions? Imagine a single soundwave that was cut into pieces and rearranged. Do you implement an optimization that all of those nodes reference the original track and just create an audio node from X to Y milliseconds. This way you could build a histogram for the original track and just slice it into pieces for each segment. Does this work for you or why not? Do you use Web Audio API? Do you use some libraries on top or wrote down everything from scratch?

Timeline component

slavagoreev

Hey, guys! I really enjoyed playing around with your system. As a developer, I planned to some kind of simple audio recorder app and I was wondering if you could suggest how to implement a timeline like you. To make thing simpler, I want to ask you some questions:

Did you used some already existing solutions. May you please share some of open-source libraries/tutorials/apps that inspired you to make it like this. AFAIK it is very similar to what iMovie or Final cut pro does but it is a native app and could be used as a reference but nothing more.
How can properly I implement resizing a cutted section so it will be extended with an original soundwave that has been cut before. Do you have any suggestions?
Imagine a single soundwave that was cut into pieces and rearranged. Do you implement an optimization that all of those nodes reference the original track and just create an audio node from X to Y milliseconds. This way you could build a histogram for the original track and just slice it into pieces for each segment. Does this work for you or why not?
Do you use Web Audio API? Do you use some libraries on top or wrote down everything from scratch?

j_w

hey slavagoreev

Glad you like it, it's pretty much custom from the ground up. 🙂

It's easier to imagine clips on the AudioNodes timeline as views on their associated data. Much like a Float32Array is a (partial or full) view of an ArrayBuffer.

Simply put, each clip in AudioNodes can be represented by 4 simple data fields: begin time, end time, offset, and position. That's it. Dragging the entire clip to the right changes the offset. Trimming from the left changes the begin time. The actual on-screen position is the sum of offset and begin time.

Some Nodes have clips with additional fields, e.g. the Audio File Node has per-clip playback rate and mode, but that's not part of the base framework.

And then finally, Nodes know about these fields, and can render their clips on the timeline, e.g. with an offset to the rendered waveform.

You get the idea.

And yes BTW, it's all based on Web Audio API under the hood, with quite some additions.

slavagoreev

Yeah, thank you for detailed explanation. As I said, do you know any state of art projects implementing the same timeline behavior? I am thinking to get something like react-dnd + resizing library + waveform rendering

j_w

slavagoreev

Nothing out of the box that I know of.

There are quite a few options out there, always have been, but in something like a more complex DAW, you'll want to render dozens of waveforms at once with 60 fps (zooming, scrolling, etc), but e.g. also not lose precision when zoomed out.

So you can't just take every nth sample from the raw audio data, or it'll look unintelligible when zoomed out. This in turn will need you to pre-compute the accurate waveform "shape", because you can't construct it from the raw audio data on every single render, per clip, when you have 20 of them. Especially with longer audio files.

And that in turn is something you'll quickly realize you need to be doing on a worker thread (or rather, a few of them in parallel), and... yeah, at that point it probably becomes a lot easier to engineer your own approach, than to try to get something ready and make attempts at getting it performant enough.

And then you may want to reduce memory use eventually by file streaming, to avoid loading 1 hour long audio files into memory (especially with Web Audio API's float32 representation of audio buffers).

So... yeah, I guess it depends on your requirements, but good chance you'll have to roll your own unless you are doing relatively simpler things.

slavagoreev

Thank you a lot. Hopefully I wont do some mistakes with this knowledge

j_w

slavagoreev np at all, shoot if you have any more questions!