Пентадекатлон мечты : Компьютерные сети и Web-технологии - Страница 181 fixfix
2014 dxdy logo

Научный форум dxdy

Математика, Физика, Computer Science, Machine Learning, LaTeX, Механика и Техника, Химия,
Биология и Медицина, Экономика и Финансовая Математика, Гуманитарные науки




Начать новую тему Ответить на тему На страницу Пред.  1 ... 178, 179, 180, 181, 182, 183, 184 ... 215  След.
 
 Re: Пентадекатлон мечты
Сообщение28.11.2022, 14:03 
Аватара пользователя


29/04/13
8365
Богородский
EUgeneUS в сообщении #1571742 писал(а):
Если оценивать машино-часы, а не штуки, то до экватора,

Ну так я же неслучайно написал

Yadryara в сообщении #1571740 писал(а):
Приближаемся к своеобразному экватору.


EUgeneUS в сообщении #1571742 писал(а):
По моим оценкам только-только перевалили 10%.

Что-то я подозреваю, что эта оценка основана на предположении, что меньшую 11-ку найти так и не удастся. Нынешний рекорд вроде бы по-прежнему 9.89e21.

 Профиль  
                  
 
 Re: Пентадекатлон мечты
Сообщение29.11.2022, 02:34 
Заслуженный участник


20/08/14
11886
Россия, Москва
Yadryara
Добил и следующий LCM:

(Оффтоп)


 Профиль  
                  
 
 Re: Пентадекатлон мечты
Сообщение29.11.2022, 10:11 
Аватара пользователя


29/04/13
8365
Богородский
Объединённая таблица по данным на сегодня.

$\tikz[scale=.08]{
\fill[green!90!blue!50] (110,160) rectangle (125,210);
\fill[green!90!blue!50] (85,170) rectangle (95,180);
\fill[green!90!blue!50] (0,140) rectangle (140,170);
\fill[green!70!blue!80] (0,120) rectangle (140,140);
\draw  (0,210) rectangle  (10,220);
\draw  (10,210) rectangle  (45,220);
\draw  (45,210) rectangle  (55,220);
\draw  (55,210) rectangle  (65,220);
\draw  (65,210) rectangle  (75,220);
\draw  (75,210) rectangle  (85,220);
\draw  (85,210) rectangle  (95,220);
\draw  (95,210) rectangle  (110,220);
\draw  (110,210) rectangle  (125,220);
\draw  (125,210) rectangle  (140,220);
\draw  (0,200) rectangle  (10,210);
\draw  (10,200) rectangle  (45,210);
\draw  (45,200) rectangle  (55,210);
\draw  (55,200) rectangle  (65,210);
\draw  (65,200) rectangle  (75,210);
\draw  (75,200) rectangle  (85,210);
\draw  (85,200) rectangle  (95,210);
\draw  (95,200) rectangle  (110,210);
\draw  (110,200) rectangle  (125,210);
\draw  (125,200) rectangle  (140,210);
\draw  (0,190) rectangle  (10,200);
\draw  (10,190) rectangle  (45,200);
\draw  (45,190) rectangle  (55,200);
\draw  (55,190) rectangle  (65,200);
\draw  (65,190) rectangle  (75,200);
\draw  (75,190) rectangle  (85,200);
\draw  (85,190) rectangle  (95,200);
\draw  (95,190) rectangle  (110,200);
\draw  (110,190) rectangle  (125,200);
\draw  (125,190) rectangle  (140,200);
\draw  (0,180) rectangle  (10,190);
\draw  (10,180) rectangle  (45,190);
\draw  (45,180) rectangle  (55,190);
\draw  (55,180) rectangle  (65,190);
\draw  (65,180) rectangle  (75,190);
\draw  (75,180) rectangle  (85,190);
\draw  (85,180) rectangle  (95,190);
\draw  (95,180) rectangle  (110,190);
\draw  (110,180) rectangle  (125,190);
\draw  (125,180) rectangle  (140,190);
\draw  (0,170) rectangle  (10,180);
\draw  (10,170) rectangle  (45,180);
\draw  (45,170) rectangle  (55,180);
\draw  (55,170) rectangle  (65,180);
\draw  (65,170) rectangle  (75,180);
\draw  (75,170) rectangle  (85,180);
\draw  (85,170) rectangle  (95,180);
\draw  (95,170) rectangle  (110,180);
\draw  (110,170) rectangle  (125,180);
\draw  (125,170) rectangle  (140,180);
\draw  (0,160) rectangle  (10,170);
\draw  (10,160) rectangle  (45,170);
\draw  (45,160) rectangle  (55,170);
\draw  (55,160) rectangle  (65,170);
\draw  (65,160) rectangle  (75,170);
\draw  (75,160) rectangle  (85,170);
\draw  (85,160) rectangle  (95,170);
\draw  (95,160) rectangle  (110,170);
\draw  (110,160) rectangle  (125,170);
\draw  (125,160) rectangle  (140,170);
\draw  (0,150) rectangle  (10,160);
\draw  (10,150) rectangle  (45,160);
\draw  (45,150) rectangle  (55,160);
\draw  (55,150) rectangle  (65,160);
\draw  (65,150) rectangle  (75,160);
\draw  (75,150) rectangle  (85,160);
\draw  (85,150) rectangle  (95,160);
\draw  (95,150) rectangle  (110,160);
\draw  (110,150) rectangle  (125,160);
\draw  (125,150) rectangle  (140,160);
\draw  (0,140) rectangle  (10,150);
\draw  (10,140) rectangle  (45,150);
\draw  (45,140) rectangle  (55,150);
\draw  (55,140) rectangle  (65,150);
\draw  (65,140) rectangle  (75,150);
\draw  (75,140) rectangle  (85,150);
\draw  (85,140) rectangle  (95,150);
\draw  (95,140) rectangle  (110,150);
\draw  (110,140) rectangle  (125,150);
\draw  (125,140) rectangle  (140,150);
\draw  (0,130) rectangle  (10,140);
\draw  (10,130) rectangle  (45,140);
\draw  (45,130) rectangle  (55,140);
\draw  (55,130) rectangle  (65,140);
\draw  (65,130) rectangle  (75,140);
\draw  (75,130) rectangle  (85,140);
\draw  (85,130) rectangle  (95,140);
\draw  (95,130) rectangle  (110,140);
\draw  (110,130) rectangle  (125,140);
\draw  (125,130) rectangle  (140,140);
\draw  (0,120) rectangle  (10,130);
\draw  (10,120) rectangle  (45,130);
\draw  (45,120) rectangle  (55,130);
\draw  (55,120) rectangle  (65,130);
\draw  (65,120) rectangle  (75,130);
\draw  (75,120) rectangle  (85,130);
\draw  (85,120) rectangle  (95,130);
\draw  (95,120) rectangle  (110,130);
\draw  (110,120) rectangle  (125,130);
\draw  (125,120) rectangle  (140,130);
\node at (4.7,215){\text{1044}};
\node at (28,215){\text{LCM}};
\node at (50,215){\text{3}};
\node at (60,215){\text{4}};
\node at (70,215){\text{5}};
\node at (80,215){\text{6}};
\node at (90,215){\text{7}};
\node at (103,215){\text{Total}};
\node at (118,215){\text{Done}};
\node at (133,215){\text{Work}};
\node at (5.6,205){\text{1.}};
\node at (36,205){\text{554400}};
\node at (50,205){\text{8}};
\node at (60,205){\text{30}};
\node at (70,205){\text{28}};
\node at (104,205){\text{66}};
\node at (90,205){\text{}};
\node at (118,205){\text{4}};
\node at (133,205){\text{5 ?}};
\node at (5.6,195){\text{2.}};
\node at (35,195){\text{3880800}};
\node at (50,195){\text{8}};
\node at (60,195){\text{46}};
\node at (70,195){\text{60}};
\node at (80,195){\text{28}};
\node at (103,195){\text{142}};
\node at (118,195){\text{26}};
\node at (133,195){\text{9 ?}};
\node at (5.6,185){\text{3.}};
\node at (35,185){\text{6098400}};
\node at (50,185){\text{8}};                       
\node at (60,185){\text{46}};
\node at (70,185){\text{78}};
\node at (80,185){\text{48}};
\node at (103,185){\text{180}};
\node at (118,185){\text{31}};
\node at (133,185){\text{2 ?}};
\node at (5.6,175){\text{4.}};
\node at (34,175){\text{42688800}};
\node at (50,175){\text{8}};
\node at (60,175){\text{54}};
\node at (70,175){\text{94}};
\node at (80,175){\text{60}};
\node at (90,175){\text{10}};
\node at (103,175){\text{226}};
\node at (118,175){\text{56}};
\node at (133,175){\text{7 ?}};
\node at (5.6,165){\text{5.}};
\node at (32,165){\text{1331114400}};
\node at (50,165){\text{}};
\node at (60,165){\text{8}};
\node at (70,165){\text{30}};
\node at (80,165){\text{28}};
\node at (104,165){\text{66}};
\node at (118,165){\text{66}};
\node at (133,165){\text{}};
\node at (5.6,155){\text{6.}};
\node at (32,155){\text{8116970400}};
\node at (50,155){\text{}};
\node at (60,155){\text{8}};
\node at (70,155){\text{26}};
\node at (80,155){\text{24}};
\node at (104,155){\text{58}};
\node at (118,155){\text{58}};
\node at (133,155){\text{}};
\node at (5.6,145){\text{7.}};
\node at (31,145){\text{14642258400}};
\node at (60,145){\text{8}};
\node at (70,145){\text{42}};
\node at (80,145){\text{66}};
\node at (90,145){\text{36}};
\node at (103,145){\text{152}};
\node at (118,145){\text{152}};
\node at (133,145){\text{}};
\node at (5.6,135){\text{8.}};
\node at (31,135){\text{56818792800}};
\node at (60,135){\text{8}};
\node at (70,135){\text{38}};
\node at (80,135){\text{40}};
\node at (90,135){\text{10}};
\node at (104,135){\text{96}};
\node at (118,135){\text{96}};
\node at (5.6,125){\text{9.}};
\node at (28,125){\text{19488845930400}};
\node at (70,125){\text{8}};
\node at (80,125){\text{26}};
\node at (90,125){\text{24}};
\node at (104,125){\text{58}};
\node at (118,125){\text{58}};
}$

Всего полностью обсчитаны хотя бы однократно $ 4 + 26 + 31 + 56 + 66 + 58 + 152 + 96 + 58 = 547 $ паттернов из $1044$ основных. Количество таких паттернов возросло на 40 по сравнению с предыдущей таблицей.

Конкретика по всем 4 ещё не полностью обсчитанным шагам:

(554400-4(66))


(3880800-26(142))


(6098400-31(180))


(42688800-56(226))



Далеко не все сообщают о том, какие паттерны в работе. Блог недоступен, но я вроде разобрался:

(Work)


 Профиль  
                  
 
 Re: Пентадекатлон мечты
Сообщение29.11.2022, 18:13 
Заслуженный участник


20/08/14
11886
Россия, Москва
Huz в сообщении #1571657 писал(а):
pcoul will allocate to an empty spot first. If a solution exists with $p^2qr$ at that empty spot and $2s^2t$ at another spot, we will skip it if $p > Z$ because $p$ "has been checked by sq12".
You're right, the idea doesn't work when placed in empty spaces. Alas.

 Профиль  
                  
 
 Re: Пентадекатлон мечты
Сообщение30.11.2022, 03:24 


05/06/22
293
Dmitriy40 I really don't fully understand why, but implementing (my understanding of) your "two-stage booster" approach so far seems to give very good results - I think even better than you had predicted.

I still have some work to do: first, to prove that it checks everything that it should; second, to modify progress reports and update the recovery process to take the new process into account.

The approach I have used is to add a new parameter "-W<walk_prime>"; it is currently supported only when running for a single pattern, and only with "-p<pmax>". The semantics are: if walk_prime is set, then we first locate the required pattern; then we iterate over each prime $p$ between walk_prime and pmax; for each p, we try allocating $p^2$ at each possible location, and whenever the allocation is successful we directly invoke a linear search (the function walk_v()). Once all $p$ have been tried, we continue with the normal process as if walk_prime was actually the requested pmax.

(For now, it is up to the caller to make sure of other requirements, such as that the number of divisors is a multiple of 3, and that $p^5 > \max$.)

I have tested it only on simpler patterns so far:
Код:
./pcoul -x:9887353188984012120346 -f11 -b1490 -p1e7 -g11 12 11
Without -W: 226.80s
-W5e6: 173.33s
-W1e6: 91.07s
# Note: p^5 < max for the remaining tests
-W1e5: 42.18s
-W1e4: 32.97s
-W1e3: 145.05s
-W9e3: 32.97s
-W9500: 32.74s

./pcoul -x:9887353188984012120346 -f11 -b1481 -p1e7 -g11 12 11
Without -W: 5234.00s
-W1e6: 1980.47s
-W5e5: 1529.46s
-W25e4: 1234.86s
# Note: p^5 < max for the remaining tests
-W12e4: 1081.17s
-W1e5: 1049.12s


Next will be tests with b1619 (LCM 554400), but they will take rather longer.

 Профиль  
                  
 
 Re: Пентадекатлон мечты
Сообщение30.11.2022, 11:29 
Заслуженный участник


20/08/14
11886
Россия, Москва
Huz в сообщении #1571984 писал(а):
I really don't fully understand why, but implementing (my understanding of) your "two-stage booster" approach so far seems to give very good results
It works because we are replacing repeatedly called code with once, albeit with a large amount of iterations. Not the code itself of course, but the interval of its execution. Accordingly it will be useful only on the second and deeper levels of recursion (the first is always called only once) and only if they are called a sufficient number of times (more than 200-400). And the more times they are called, the greater the gain.
And yes, I don't think it makes sense to lower the threshold below 22e3, where the fifth degrees come into play.

 Профиль  
                  
 
 Re: Пентадекатлон мечты
Сообщение30.11.2022, 13:35 
Аватара пользователя


29/04/13
8365
Богородский
Объединённая таблица по данным на сегодня.

$\tikz[scale=.08]{
\fill[green!90!blue!50] (110,160) rectangle (125,210);
\fill[green!90!blue!50] (85,170) rectangle (95,180);
\fill[green!90!blue!50] (0,140) rectangle (140,170);
\fill[green!70!blue!80] (0,120) rectangle (140,140);
\draw  (0,210) rectangle  (10,220);
\draw  (10,210) rectangle  (45,220);
\draw  (45,210) rectangle  (55,220);
\draw  (55,210) rectangle  (65,220);
\draw  (65,210) rectangle  (75,220);
\draw  (75,210) rectangle  (85,220);
\draw  (85,210) rectangle  (95,220);
\draw  (95,210) rectangle  (110,220);
\draw  (110,210) rectangle  (125,220);
\draw  (125,210) rectangle  (140,220);
\draw  (0,200) rectangle  (10,210);
\draw  (10,200) rectangle  (45,210);
\draw  (45,200) rectangle  (55,210);
\draw  (55,200) rectangle  (65,210);
\draw  (65,200) rectangle  (75,210);
\draw  (75,200) rectangle  (85,210);
\draw  (85,200) rectangle  (95,210);
\draw  (95,200) rectangle  (110,210);
\draw  (110,200) rectangle  (125,210);
\draw  (125,200) rectangle  (140,210);
\draw  (0,190) rectangle  (10,200);
\draw  (10,190) rectangle  (45,200);
\draw  (45,190) rectangle  (55,200);
\draw  (55,190) rectangle  (65,200);
\draw  (65,190) rectangle  (75,200);
\draw  (75,190) rectangle  (85,200);
\draw  (85,190) rectangle  (95,200);
\draw  (95,190) rectangle  (110,200);
\draw  (110,190) rectangle  (125,200);
\draw  (125,190) rectangle  (140,200);
\draw  (0,180) rectangle  (10,190);
\draw  (10,180) rectangle  (45,190);
\draw  (45,180) rectangle  (55,190);
\draw  (55,180) rectangle  (65,190);
\draw  (65,180) rectangle  (75,190);
\draw  (75,180) rectangle  (85,190);
\draw  (85,180) rectangle  (95,190);
\draw  (95,180) rectangle  (110,190);
\draw  (110,180) rectangle  (125,190);
\draw  (125,180) rectangle  (140,190);
\draw  (0,170) rectangle  (10,180);
\draw  (10,170) rectangle  (45,180);
\draw  (45,170) rectangle  (55,180);
\draw  (55,170) rectangle  (65,180);
\draw  (65,170) rectangle  (75,180);
\draw  (75,170) rectangle  (85,180);
\draw  (85,170) rectangle  (95,180);
\draw  (95,170) rectangle  (110,180);
\draw  (110,170) rectangle  (125,180);
\draw  (125,170) rectangle  (140,180);
\draw  (0,160) rectangle  (10,170);
\draw  (10,160) rectangle  (45,170);
\draw  (45,160) rectangle  (55,170);
\draw  (55,160) rectangle  (65,170);
\draw  (65,160) rectangle  (75,170);
\draw  (75,160) rectangle  (85,170);
\draw  (85,160) rectangle  (95,170);
\draw  (95,160) rectangle  (110,170);
\draw  (110,160) rectangle  (125,170);
\draw  (125,160) rectangle  (140,170);
\draw  (0,150) rectangle  (10,160);
\draw  (10,150) rectangle  (45,160);
\draw  (45,150) rectangle  (55,160);
\draw  (55,150) rectangle  (65,160);
\draw  (65,150) rectangle  (75,160);
\draw  (75,150) rectangle  (85,160);
\draw  (85,150) rectangle  (95,160);
\draw  (95,150) rectangle  (110,160);
\draw  (110,150) rectangle  (125,160);
\draw  (125,150) rectangle  (140,160);
\draw  (0,140) rectangle  (10,150);
\draw  (10,140) rectangle  (45,150);
\draw  (45,140) rectangle  (55,150);
\draw  (55,140) rectangle  (65,150);
\draw  (65,140) rectangle  (75,150);
\draw  (75,140) rectangle  (85,150);
\draw  (85,140) rectangle  (95,150);
\draw  (95,140) rectangle  (110,150);
\draw  (110,140) rectangle  (125,150);
\draw  (125,140) rectangle  (140,150);
\draw  (0,130) rectangle  (10,140);
\draw  (10,130) rectangle  (45,140);
\draw  (45,130) rectangle  (55,140);
\draw  (55,130) rectangle  (65,140);
\draw  (65,130) rectangle  (75,140);
\draw  (75,130) rectangle  (85,140);
\draw  (85,130) rectangle  (95,140);
\draw  (95,130) rectangle  (110,140);
\draw  (110,130) rectangle  (125,140);
\draw  (125,130) rectangle  (140,140);
\draw  (0,120) rectangle  (10,130);
\draw  (10,120) rectangle  (45,130);
\draw  (45,120) rectangle  (55,130);
\draw  (55,120) rectangle  (65,130);
\draw  (65,120) rectangle  (75,130);
\draw  (75,120) rectangle  (85,130);
\draw  (85,120) rectangle  (95,130);
\draw  (95,120) rectangle  (110,130);
\draw  (110,120) rectangle  (125,130);
\draw  (125,120) rectangle  (140,130);
\node at (4.7,215){\text{1044}};
\node at (28,215){\text{LCM}};
\node at (50,215){\text{3}};
\node at (60,215){\text{4}};
\node at (70,215){\text{5}};
\node at (80,215){\text{6}};
\node at (90,215){\text{7}};
\node at (103,215){\text{Total}};
\node at (118,215){\text{Done}};
\node at (133,215){\text{Work}};
\node at (5.6,205){\text{1.}};
\node at (36,205){\text{554400}};
\node at (50,205){\text{8}};
\node at (60,205){\text{30}};
\node at (70,205){\text{28}};
\node at (104,205){\text{66}};
\node at (90,205){\text{}};
\node at (118,205){\text{6}};
\node at (133,205){\text{5 ?}};
\node at (5.6,195){\text{2.}};
\node at (35,195){\text{3880800}};
\node at (50,195){\text{8}};
\node at (60,195){\text{46}};
\node at (70,195){\text{60}};
\node at (80,195){\text{28}};
\node at (103,195){\text{142}};
\node at (118,195){\text{42}};
\node at (133,195){\text{2 ?}};
\node at (5.6,185){\text{3.}};
\node at (35,185){\text{6098400}};
\node at (50,185){\text{8}};                       
\node at (60,185){\text{46}};
\node at (70,185){\text{78}};
\node at (80,185){\text{48}};
\node at (103,185){\text{180}};
\node at (118,185){\text{36}};
\node at (133,185){\text{1 ?}};
\node at (5.6,175){\text{4.}};
\node at (34,175){\text{42688800}};
\node at (50,175){\text{8}};
\node at (60,175){\text{54}};
\node at (70,175){\text{94}};
\node at (80,175){\text{60}};
\node at (90,175){\text{10}};
\node at (103,175){\text{226}};
\node at (118,175){\text{75}};
\node at (133,175){\text{?}};
\node at (5.6,165){\text{5.}};
\node at (32,165){\text{1331114400}};
\node at (50,165){\text{}};
\node at (60,165){\text{8}};
\node at (70,165){\text{30}};
\node at (80,165){\text{28}};
\node at (104,165){\text{66}};
\node at (118,165){\text{66}};
\node at (133,165){\text{}};
\node at (5.6,155){\text{6.}};
\node at (32,155){\text{8116970400}};
\node at (50,155){\text{}};
\node at (60,155){\text{8}};
\node at (70,155){\text{26}};
\node at (80,155){\text{24}};
\node at (104,155){\text{58}};
\node at (118,155){\text{58}};
\node at (133,155){\text{}};
\node at (5.6,145){\text{7.}};
\node at (31,145){\text{14642258400}};
\node at (60,145){\text{8}};
\node at (70,145){\text{42}};
\node at (80,145){\text{66}};
\node at (90,145){\text{36}};
\node at (103,145){\text{152}};
\node at (118,145){\text{152}};
\node at (133,145){\text{}};
\node at (5.6,135){\text{8.}};
\node at (31,135){\text{56818792800}};
\node at (60,135){\text{8}};
\node at (70,135){\text{38}};
\node at (80,135){\text{40}};
\node at (90,135){\text{10}};
\node at (104,135){\text{96}};
\node at (118,135){\text{96}};
\node at (5.6,125){\text{9.}};
\node at (28,125){\text{19488845930400}};
\node at (70,125){\text{8}};
\node at (80,125){\text{26}};
\node at (90,125){\text{24}};
\node at (104,125){\text{58}};
\node at (118,125){\text{58}};
}$

Всего полностью обсчитаны хотя бы однократно $ 6 + 42 + 36 + 75 + 66 + 58 + 152 + 96 + 58 = 589 $ паттернов из $1044$ основных. Количество таких паттернов возросло на 42 по сравнению с предыдущей таблицей.

Некоторые сомнения вызывает паттерн b628, который не помечен красным. Неужто так быстро был полностью обсчитан?

Конкретика по всем 4 ещё не полностью обсчитанным шагам:

(554400-6(66))


(3880800-42(142))


(6098400-36(180))


(42688800-75(226))



Далеко не все сообщают о том, какие паттерны в работе. Поэтому инфа всего лишь о 8 паттернах:

(Work 8)


 Профиль  
                  
 
 Re: Пентадекатлон мечты
Сообщение30.11.2022, 17:04 


05/06/22
293
Dmitriy40 в сообщении #1572024 писал(а):
Huz в сообщении #1571984 писал(а):
I really don't fully understand why, but implementing (my understanding of) your "two-stage booster" approach so far seems to give very good results
It works because we are replacing repeatedly called code with once, albeit with a large amount of iterations. Not the code itself of course, but the interval of its execution. Accordingly it will be useful only on the second and deeper levels of recursion (the first is always called only once) and only if they are called a sufficient number of times (more than 200-400). And the more times they are called, the greater the gain.
And yes, I don't think it makes sense to lower the threshold below 22e3, where the fifth degrees come into play.

I believe the original code tests only those numbers that need to be tested. So if this approach is faster, I think there are two possibilities: either there is something wrong with the logic, and we are not testing everything that should be tested; or we are managing to test the same things with less overhead.

Proving that the logic is correct is not trivial, but I can display every value tested, and compare similar runs with and without the new option. That may not be practical to do for a complete LCM=554400 run, but shorter runs should be sufficient to decide how much confidence we should have.

If the logic is correct, it implies the overhead in the original code is a higher proportion of the total time than I had anticipated. There are two main areas where this can occur: in the recursion, where we already saved a lot of time with the prime iterators, but maybe there is more that could be saved; and in the linear search, where there is a lot of unavoidable preparation in each call for searches that may only end up testing a small number of values (often 0 or 1).

But if the overhead is so large, there may be more general savings to be made.

When I optimize perl code, I take advantage of a wonderful profiler (Devel::NYTProf) that lets me see very precisely how often each line of code is reached, including the call stack, and how much time has been spent on it. As far as I know the state of the art in profilers for other languages is much more primitive, but something like this would be really helpful to analyse where time is being spent.

-- 30.11.2022, 14:06 --

Yadryara в сообщении #1572040 писал(а):
Некоторые сомнения вызывает паттерн b628, который не помечен красным.

Thanks, that was my error. I have also rearranged how the information is shown (it is all now collected below the main list).

 Профиль  
                  
 
 Re: Пентадекатлон мечты
Сообщение30.11.2022, 17:42 
Заслуженный участник


20/08/14
11886
Россия, Москва
Huz
It's not overhead, it's multiple checks on about the same thing.
And the speed comes from the difference in the methods of checking: in sq12.c for each $p$ all variants of $qr$ are checked, while here for each particular pattern only a linear search is performed, which is very, very often much shorter. This is because the $qr$ search in sq12.c does not depend on the current LCM, but the linear search very much depends on it.

 Профиль  
                  
 
 Re: Пентадекатлон мечты
Сообщение30.11.2022, 18:58 


05/06/22
293
Dmitriy40
I do not believe that the existing code ever checks the same thing twice. (At least, not for 12 divisors.)

It may check both $n$ and $n+1$, which the sq12 test avoids by finding the nearest multiple of 32. But the two-stage process does not do this, it would check both $n$ and $n+1$ if both were valid to check. Indeed, because it tries $p^2$ in each possible position, it may end up duplicating checks when $p_1^2$ in one position and $p_2^2$ in another position coincide - the one-stage code would never duplicate those.

I will know more when I manage to complete the testing I described.

 Профиль  
                  
 
 Re: Пентадекатлон мечты
Сообщение30.11.2022, 21:20 


05/06/22
293
Huz в сообщении #1572067 писал(а):
I do not believe that the existing code ever checks the same thing twice. (At least, not for 12 divisors.)

Apparently I was wrong, I'll need to check that separately (see below) - without the new parameter, testing b1490 with -p1e7 tests 60,793,890 values of which 60,416,363 are unique. If I add -W21818 it tests 5,373,610 values of which 5,349,326 are unique.

Examining the first [1] example of a number tested in the first case but not in the second case (100000018389638352218 [2]), I find that it would have been generated by allocating $73^2$ in the fourth position, then $71^2$ in the second position. Stepping through the two cases, the difference becomes clear: in the second case the number of recursions (which is capped to the maximum prime) is far lower, so in this case it goes through an additional level of recursion (now in the first position), so it never attempts to test the value $100000018389638352218 = 2 \cdot 50000009194819176109$ in the first position since it contains no squared prime. Without -W we find the number of recursions would be higher, so we choose a linear search at this point.

Thus the huge difference in speed is caused largely by making it cheaper to recurse an extra level, coupled with existing logic that takes advantage of that.

Unfortunately that means I cannot verify the logic using this data, I'll need to think of something else.

The duplications, it turns out, are caused by something similar. The first example here is 1000069913781905952218, which is found by allocating $53^2$ in the fourth position. The second position is then $1000069913781905952219 = 3 \cdot 13^2 \cdot 163 \cdot 263^2 \cdot 563 \cdot 310752697$, and when we recurse over the second position we try allocating both $13^2$ and $263^2$ at different times. But as we can see from the numbers at the top, those duplications are less than 1% of the total work.

[1] by ASCII sort
[2] this is the value of the first number in the (possible) chain.

 Профиль  
                  
 
 Re: Пентадекатлон мечты
Сообщение01.12.2022, 07:28 
Заслуженный участник


20/08/14
11886
Россия, Москва
Huz
That's why I said that you do not need to edit pcoul, and make a check separately, for starters. After all, you have already agreed with the reduction of -pZ, and how exactly we reduce it (and for all patterns or for one particular pattern) does not matter.
That almost all checked chains are unique is of course fine, but you have not counted everything: iterator of prime numbers and CRT work not only for checked chains, but for all searched chains as well, which are supposedly more. And this also takes time, after all CRT is not just two or three divisions.
And in general, it is clear that with decreasing -pZ the number of checked chains decreases, much more interesting is where the speed gain comes from sq12.c, not from pcoul with different -pZ, why it can not decrease threshold as fast. And here I think it has to do with a different checking method: sq12.c tries many variants of qr, while pcoul -W tries linearly (i.e. without requiring calling CRT for each iteration), which turns out to be faster in itself. But apart from that, the linear search also depends on the current accumulated LCM (after substitution of simple non-upper levels of recursion), which does not affect sq12.c in any way, but very much affects the linear search.
So I think it's not just about the number of chains checked, even with the same number the gain will still be, although of course less.

 Профиль  
                  
 
 Re: Пентадекатлон мечты
Сообщение01.12.2022, 08:34 
Аватара пользователя


11/12/16
14080
уездный город Н
Некоторые комментарии по текущему процессу.

1. Ещё вчера было выполнено $1/4$ работы, а сегодня уже $1/3$ (это в условных машино-часах).
2. Оптимистичные оценки показывают, что такими темпами всё будет завершено через 8 дней (как раз к 9 декабря :D ).
3. Но это сильно оптимистичные оценки. Паттерны с $LCM=554400$ считаются (на моей стороне, у других может быть несколько быстрее) за 3-4 дня. И если в один поток попадёт 2 таких паттерна (а у меня точно попадут, так как их осталось больше, чем потоков), то это уже 6-8 дней. А если паттерны с $LCM=554400$ начнутся считаться где-то через неделю, то эти потоки и будут задерживать общий результат.

Вывод: в данный момент хорошо бы считать паттерны с $LCM=554400$, а остальное - досчитывать позже, уравнивая время работы потоков.
UPD: к "остальному" относятся паттерны с $LCM \in \left\lbrace 3880800, 6098400, 42688800 \right\rbrace$
Паттерны с $LCM \geqslant 1331114400$ в этом контексте значния не имеют, они считаются слишком быстро, чтобы учитывать это время.

Я планирую к выходным \ на выходных (по завершению текущих заданий) направить 90% мощностей (10-11 потоков из 12) на расчет паттернов $LCM=554400$.

 Профиль  
                  
 
 Re: Пентадекатлон мечты
Сообщение01.12.2022, 16:46 


05/06/22
293
EUgeneUS в сообщении #1572107 писал(а):
Вывод: в данный момент хорошо бы считать паттерны с $LCM=554400$, а остальное - досчитывать позже, уравнивая время работы потоков.

I suggest to wait just a little longer before doing that.

I'm about to release a new version with -W. This will make the patterns with LCM=554400 much faster - but you cannot add -W to a process that has already started, you would need to start it from the beginning again.

So better not to start the longest processes immediately, please wait for the new version.

-- 01.12.2022, 14:02 --

Dmitriy40 в сообщении #1572101 писал(а):
Huz
That's why I said that you do not need to edit pcoul, and make a check separately, for starters. After all, you have already agreed with the reduction of -pZ, and how exactly we reduce it (and for all patterns or for one particular pattern) does not matter.
It would have been much harder to do a separate check, within pcoul I have all the information already available. Almost all the effort went into updating the progress reporting and recovery process, the core loop is very simple:

(Оффтоп)


Цитата:
That almost all checked chains are unique is of course fine, but you have not counted everything: iterator of prime numbers and CRT work not only for checked chains, but for all searched chains as well, which are supposedly more. And this also takes time, after all CRT is not just two or three divisions.
Yes, this is the overhead I was speaking of. There is a third element, calculating the inverses of allocated prime powers as part of the linear search. I think there are opportunities for further optimization in all three of these areas.

 Профиль  
                  
 
 Re: Пентадекатлон мечты
Сообщение01.12.2022, 17:02 
Аватара пользователя


11/12/16
14080
уездный город Н
Huz
Well.
And what are the forecasts for the release of the new version?
If it’s a week or more, then I’ll have time to count a pack of such patterns.

-- 01.12.2022, 17:13 --

FYI. I tried to analyze this code, and did not find any logical errors in it. However, this does not mean that it cannot be.

Huz в сообщении #1572079 писал(а):
Unfortunately that means I cannot verify the logic using this data, I'll need to think of something else.


I can suggest this way:
1. Let's by using -W we reduce the prime threshold from 1e7 to 1e6
2. Let's choose some prime number in this range.
3. Output sets of candidates that were processed with the -W option and the "usual way" that contain the selected prime number
4. and compare them

 Профиль  
                  
Показать сообщения за:  Поле сортировки  
Начать новую тему Ответить на тему  [ Сообщений: 3218 ]  На страницу Пред.  1 ... 178, 179, 180, 181, 182, 183, 184 ... 215  След.

Модераторы: Karan, Toucan, PAV, maxal, Супермодераторы



Кто сейчас на конференции

Сейчас этот форум просматривают: нет зарегистрированных пользователей


Вы не можете начинать темы
Вы не можете отвечать на сообщения
Вы не можете редактировать свои сообщения
Вы не можете удалять свои сообщения
Вы не можете добавлять вложения

Найти:
Powered by phpBB © 2000, 2002, 2005, 2007 phpBB Group