V2EX › Projection 的所有回复 › 第 1 页 / 共 4 页

V2EX = way to explore

V2EX 是一个关于分享和探索的地方

现在注册

已注册用户请登录

1 2 3 4

❮

❯

75 天前

回复了 teli 创建的主题 › Python › Python 升序、查找高效的集合方案？

创建一个类并自定义 __contains__ 魔术方法，实现则使用 bisect 二分查找。根据这个思路找 GPT 生成了代码：

import bisect

class OrderedList:
def __init__(self, items):
# 确保列表有序
self.items = sorted(items)

# 自定义 in 操作，使用二分查找
def __contains__(self, item):
# 使用 bisect 模块进行二分查找
index = bisect.bisect_left(self.items, item)
# 检查查找到的索引是否在范围内，且对应元素是否与目标相等
return index < len(self.items) and self.items[index] == item

# 支持遍历
def __iter__(self):
return iter(self.items)

# 使用示例
ordered_list = OrderedList([10, 1, 7, 3, 5])

# 遍历
for item in ordered_list:
print(item) # 输出: 1 3 5 7 10 (有序)

# 使用自定义的 in 操作（使用二分查找）
print(7 in ordered_list) # 输出: True
print(6 in ordered_list) # 输出: False

84 天前

回复了 iqoo 创建的主题 › 程序员 › 分享一个空间利用率超高的 Base36 算法

原来是分组的 Base36 ，我还以为是常规的 Base36 ，然后优化内存使用

84 天前

回复了 iqoo 创建的主题 › 程序员 › 分享一个空间利用率超高的 Base36 算法

我想明白你怎么算的了：

```python
import math

for m in range(1, 16):
n = math.ceil(math.log(256 ** m, 36))
ratio = m / n
print(f'{m=} bytes, {n=} chars, {ratio=:.2%}')
```

m=1 bytes, n=2 chars, ratio=50.00%
m=2 bytes, n=4 chars, ratio=50.00%
m=3 bytes, n=5 chars, ratio=60.00%
m=4 bytes, n=7 chars, ratio=57.14%
m=5 bytes, n=8 chars, ratio=62.50%
m=6 bytes, n=10 chars, ratio=60.00%
m=7 bytes, n=11 chars, ratio=63.64%
m=8 bytes, n=13 chars, ratio=61.54%
m=9 bytes, n=14 chars, ratio=64.29%
m=10 bytes, n=16 chars, ratio=62.50%
m=11 bytes, n=18 chars, ratio=61.11%
m=12 bytes, n=19 chars, ratio=63.16%
m=13 bytes, n=21 chars, ratio=61.90%
m=14 bytes, n=22 chars, ratio=63.64%
m=15 bytes, n=24 chars, ratio=62.50%

84 天前

回复了 iqoo 创建的主题 › 程序员 › 分享一个空间利用率超高的 Base36 算法

@iqoo 任何程序编码出来结果都是一样的，和空间利用率有什么关系

84 天前

回复了 iqoo 创建的主题 › 程序员 › 分享一个空间利用率超高的 Base36 算法

没有搞懂你说的空间利用率是什么意思。

假设我每次只读取一个字节的数据而不是 9 字节，那么刚开始会产生一个字符，照你的说法空间利用率不就是 100% 了？

每次读取 9 字节只能保证至少产生 13 个字符（比如刚开始时），有时才会产生 14 个字符。当产生 13 个字符时，你说的空间利用率就是 9 / 13 = 0.69 了。

>>> 36 ** 13 < 256 ** 9 < 36 ** 14
True

你可能是想要找到一个缓存区的大小 m ，满足 `256 ** m >= 36 ** n` 且左右两边占用 bits 差值足够小，但是概念上没有理清。我倒是觉得应该这样算：

```python
import math

base = 36

for m in range(1, 16):
n = int(math.log(256 ** m, 36))
waste = m * 8 - math.log2(36 ** n)
print(f'{m=} bytes, {n=} chars, {waste=} bits')
```

m=1 bytes, n=1 chars, waste=2.830074998557688 bits
m=2 bytes, n=3 chars, waste=0.49022499567306355 bits
m=3 bytes, n=4 chars, waste=3.3202999942307514 bits
m=4 bytes, n=6 chars, waste=0.9804499913461271 bits
m=5 bytes, n=7 chars, waste=3.8105249899038114 bits
m=6 bytes, n=9 chars, waste=1.470674987019187 bits
m=7 bytes, n=10 chars, waste=4.3007499855768785 bits
m=8 bytes, n=12 chars, waste=1.9608999826922542 bits
m=9 bytes, n=13 chars, waste=4.790974981249946 bits
m=10 bytes, n=15 chars, waste=2.451124978365314 bits
m=11 bytes, n=17 chars, waste=0.11127497548069698 bits
m=12 bytes, n=18 chars, waste=2.941349974038374 bits
m=13 bytes, n=20 chars, waste=0.601499971153757 bits
m=14 bytes, n=21 chars, waste=3.431574969711434 bits
m=15 bytes, n=23 chars, waste=1.091724966826817 bits

很明显 9 字节的空间利用率非常低。即便如此浪费的也是几个 bits 而已，在字节这个粒度下感觉空间利用率就是个伪需求。

86 天前

回复了 Asuler 创建的主题 › 程序员 › 程序员菜鸟，请教一个 web 视频转码的问题

https://chromium.woolyss.com/ 有包含 H.265 解码器的 Chromium 下载
为哈非得在浏览器播放，本地随便起个播放器不行吗

128 天前

回复了 lsk569937453 创建的主题 › 程序员 › 有个前端并发数的问题

requestDelay() 前面要加 await 才行，不然你这样就是开启 10 个请求，后面 requestInstant() 只能排队

129 天前

回复了 nbin2008 创建的主题 › JavaScript › 请教大家一个问题， js，异步执行

你说的这个用 Streams API 很容易实现：

```js
import { setTimeout } from "node:timers/promises";

let id = 0;

async function fetchData() {
console.log("fetchData()");
await setTimeout(Math.random() * 500);
return Array(10)
.fill(null)
.map(() => ({ id: ++id }));
}

const stream = new ReadableStream(
{
async pull(controller) {
const data = await fetchData();
for (const a of data) {
controller.enqueue(a);
}
},
},
new CountQueuingStrategy({ highWaterMark: 0 }), // fully lazy
).values();

async function getD() {
return (await stream.next()).value;
}

for (let i = 0; i < 30; i++) {
console.log("data", await getD());
}
```

输出结果：

fetchData()
data { id: 1 }
data { id: 2 }
data { id: 3 }
data { id: 4 }
data { id: 5 }
data { id: 6 }
data { id: 7 }
data { id: 8 }
data { id: 9 }
data { id: 10 }
fetchData()
data { id: 11 }
data { id: 12 }
data { id: 13 }
data { id: 14 }
data { id: 15 }
data { id: 16 }
data { id: 17 }
data { id: 18 }
data { id: 19 }
data { id: 20 }
fetchData()
data { id: 21 }
data { id: 22 }
data { id: 23 }
data { id: 24 }
data { id: 25 }
data { id: 26 }
data { id: 27 }
data { id: 28 }
data { id: 29 }
data { id: 30 }

这里为了定义你所需要 getD() 显得把代码搞复杂了一点，而实际上你可以根据情况使用 for await... of 。

你从 LLM 获得的代码有轮询的部分，所以强烈不推荐：

await new Promise(resolve => setTimeout(resolve, 100)); // 等待数据获取完成

140 天前

回复了 dvsilch 创建的主题 › JavaScript › JS 中，串行异步任务的取消是否有更好的处理方式

我用生成器简单实现了一下，代码 https://pastebin.com/hi04KbPd

function delay(ms) {
return new Promise((resolve) => setTimeout(resolve, ms));
}

const then = Date.now();

async function task(name, input) {
console.log(`[${Date.now() - then}] ${name}: start, input: ${input}`);
await delay(1_000);
console.log(`[${Date.now() - then}] ${name}: done, input: ${input}`);
return input + name;
}

let token = 0;
async function onUpdate(input) {
let current = ++token;
async function* gen() {
let result = await task('a', input);
yield;
result = await task('b', result);
yield;
result = await task('c', result);
console.log(`[${Date.now() - then}] final result: ${result}`);
return result;
}
for await (const _ of gen()) {
if (token !== current) break;
}
}

onUpdate('1');
await delay(500);
onUpdate('2');
await delay(2_000);
onUpdate('3');

输出结果：

[0] a: start, input: 1
[510] a: start, input: 2
[1006] a: done, input: 1
[1526] a: done, input: 2
[1527] b: start, input: 2a
[2516] a: start, input: 3
[2533] b: done, input: 2a
[3520] a: done, input: 3
[3520] b: start, input: 3a
[4528] b: done, input: 3a
[4530] c: start, input: 3ab
[5536] c: done, input: 3ab
[5537] final result: 3abc

140 天前

回复了 dvsilch 创建的主题 › JavaScript › JS 中，串行异步任务的取消是否有更好的处理方式

@rabbbit 用生成器确实是一个好办法，在可被取消的时间点添加 yield 。暂停后只要不调用 next() 方法就不会继续执行，这样就需要自己写一个执行器，调度的时机可以自己灵活控制，像 co 那种执行器是自动执行的，async 函数也是自动执行的。

140 天前

回复了 dvsilch 创建的主题 › JavaScript › JS 中，串行异步任务的取消是否有更好的处理方式

给输出结果加个时间线：

[1] Starting task A with input: 1
[514] Starting task A with input: 2
[1010] Finishing task A with input: 1
[1537] Finishing task A with input: 2
[1539] Starting task B with input: 2A
[2019] Starting task A with input: 3
[2542] Finishing task B with input: 2A
[3023] Finishing task A with input: 3
[3024] Starting task B with input: 3A
[4027] Finishing task B with input: 3A
[4028] Starting task C with input: 3AB
[5038] Finishing task C with input: 3AB
[5040] Final result: 3ABC

140 天前

回复了 dvsilch 创建的主题 › JavaScript › JS 中，串行异步任务的取消是否有更好的处理方式

用 RxJS 可以方便地实现，跟 AI 交流了一会后给出了如下代码：

import { Subject, from } from 'rxjs';
import { switchMap } from 'rxjs/operators';

// 模拟异步任务 A 、B 、C
const taskA = (input) => {
console.log(`Starting task A with input: ${input}`);
return new Promise((resolve) => {
setTimeout(() => {
console.log(`Finishing task A with input: ${input}`);
resolve(input + 'A');
}, 1000); // 假设任务 A 需要 1 秒钟
});
};

const taskB = (input) => {
console.log(`Starting task B with input: ${input}`);
return new Promise((resolve) => {
setTimeout(() => {
console.log(`Finishing task B with input: ${input}`);
resolve(input + 'B');
}, 1000); // 假设任务 B 需要 1 秒钟
});
};

const taskC = (input) => {
console.log(`Starting task C with input: ${input}`);
return new Promise((resolve) => {
setTimeout(() => {
console.log(`Finishing task C with input: ${input}`);
resolve(input + 'C');
}, 1000); // 假设任务 C 需要 1 秒钟
});
};

const subject = new Subject();

subject
.pipe(
switchMap((value) =>
from(taskA(value)).pipe(
switchMap((resultA) => from(taskB(resultA))),
switchMap((resultB) => from(taskC(resultB))),
),
),
)
.subscribe((result) => {
console.log('Final result:', result);
});

// 发出一些值
subject.next('1'); // 发出第一个值
setTimeout(() => subject.next('2'), 500); // 在 0.5 秒后发出第二个值，中止第一个任务并开始新的任务
setTimeout(() => subject.next('3'), 2000); // 在 2 秒后发出第三个值

输出结果：

Starting task A with input: 1
Starting task A with input: 2
Finishing task A with input: 1
Finishing task A with input: 2
Starting task B with input: 2A
Starting task A with input: 3
Finishing task B with input: 2A
Finishing task A with input: 3
Starting task B with input: 3A
Finishing task B with input: 3A
Starting task C with input: 3AB
Finishing task C with input: 3AB
Final result: 3ABC

实际上用全套 RxJS API 更简单，无奈 API 忘了好多

267 天前

回复了 nnegier 创建的主题 › 程序员 › 你们的数据库开放外网 IP 链接吗，我想用 DataGrip

除了 VPN 、SSH 隧道，mTLS 也可以，甚至可以把这三个组合在一起用。

298 天前

回复了 bclerdx 创建的主题 › Windows › 关于 PC 版本的应用程序或软件是 32 位还是 64 位的区别？

一般来说，安装程序不会写死 "Program Files" 或 "Program Files (x86)" 这个路径，而是直接从系统环境变量中读取 "ProgramFiles"。如果程序编译目标目标平台为 x86 那么获得到的环境变量可能为 "Program Files" 或 "Program Files (x86)"（取决于操作系统的位数）；如果目标为 x64 ，那么只可能获得 "Program Files"（ 32 位操作系统无法直接运行 64 位程序）。

当然不排除个别程序有写死安装路径的可能。
你可以自己写个程序来验证。

311 天前

回复了 Inzufu 创建的主题 › 音乐 › 感觉网易云的音质要好于 Spotify？

@Satelli
Tidal 早就逐步开始逐步替换 MQA 为 Hi-Res FLAC 了，现在新歌我看到的都是 Hi-Res FLAC ，讨论 MQA 音质如何没有太大意义了。一个简单的判断方法是在浏览器中打开网页播放器，如果有 MAX 标识，那么就是了。

313 天前

回复了 wadejs 创建的主题 › 前端开发 › 撸了个命令行工具，快速用 vscode 打开指定目录

只要用 VS Code 打开过一次，以后就可以启动 VS Code ，然后 Ctrl +R 打开最近使用的目录或文件

317 天前

回复了 Gdao3386 创建的主题 › 程序员 › 求助，有没有免费或者不需要企业认证的电话语音通知

@vilic 饭碗警告 mentioned

353 天前

回复了 Steps 创建的主题 › 问与答 › 前几天在 twitter 看到一条关于腾讯的代码，可以获取到真实的访问地址。

@2218675712 是的，所以最好不要使用网络层面的分流。网络层面的分流无法判断这个请求来自哪个应用（或哪个 tab ），“分流”最好在应用上做，比如开多个浏览器（或虚拟机）：一个只访问国内网站不用代理，另一个只用代理但绝不登录国内网站。

如果某个国内网站真心想要探测代理（开启分流时），完全可以搞很多境外服务器，随机域名、随机 IP ，这样无论啥分流规则都没有用。分流规则一定是不完善且有滞后性的。

1 2 3 4

❮

❯